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Introduction 


Matrix  metalloproteases  (MMPs)  are  zinc-dependent  endopeptidases  that  degrade 
extracellular  matrix  components.  Two  of  these  proteinases,  MMP-2  and  MMP-9,  also 
known  as  gelatinases  A  and  B,  respectively,  are  involved  in  breast  tumor  metastasis  and  the 
process  of  neovascularization.  Evidence  indicates  that  the  activity  of  gelatinases  is  critical 
for  breast  cancer  tumor  invasion  and  angiogenesis.  We  have  undertaken  a  multidisciplinary 
research  approach  for  inhibition  of  the  gelatinase-mediated  tumor  cell  invasion  and 
angiogenesis  using  the  first  novel  synthetic  mechanism-based  inactivators  targeted  for 
gelatinases.  These  inactivators  are  expected  to  show  inherent  high  selectivity  for 
gelatiriases,  as  previously  demonstrated  by  us  with  similar  types  of  inactivators  for  other 
families  of  metalloproteases.  The  efficacy  and  selectivity  of  the  inactivators  will  be 
evaluated  with  purified  preparations  of  recombinant  gelatinases  A  and  B  (MMP-2,  MMP- 
9),  stromelysin-1  (MMP-3)  and  collagenase-3  (MMP-13).  Provisions  have  been  made  so 
that  the  best  inactivator  of  our  design  would  be  evaluated  in  in  vitro  invasion  and 
angiogenesis  assays,  as  well  as  in  animal  models  for  breast  tumor  metastasis  in 
collaborations  with  other  groups  in  the  foreseeable  future. 

Body 

We  had  reported  several  inhibitors  last  year  that  showed  the  basic  desirable  traits  of 
mechanism-based  inhibition  for  gelatinases.  The  problem  rested  with  their  relative  poor 
affinity  for  the  enzyme.  We  had  to  increase  the  affinity,  if  these  strategies  were  to  be 
successful.  We  have  gone  back  to  the  computational  models  for  inhibition  of  gelatinases 
and  have  designed  new  structural  motifs  for  the  inhibitors  that  we  hope  would  show  the 
mechanism-based  inhibition  characteristic,  as  well  as  high  affinity. 

The  computational  models  commence  with  the  predicted  three-dimensional  structure 
for  gelatinases  A  and  B  that  we  reported  recently.1  These  models  serve  as  the  centerpieces 
of  the  design  work.  We  hasten  to  add  that  we  have  made  available  to  the  scientific 
community  these  models  and  others  at  our  group  Web  site 
(http://sun2.science.wayne.edu/~somgroup/).  Our  Web  site  has  had  over  1000  visitors 
since  late  in  October  of  1999,  when  we  started  recording  the  number  of  visitors.  These 
visitors  often  connect  to  our  Web  site  to  download  the  coordinates  for  our  models.  Dr. 
Dan  Ortwine  of  the  Parke-Davis  Pharmaceutical  Company  recently  wrote  to  us  to  report 
that  they  have  determined  the  X-ray  structure  of  the  catalytic  domain  of  gelatinase  A,  and 
that  it  matches  our  model  for  it.  Hence,  the  models  are  highly  reliable,  and  they  are 
obviously  of  use  to  many  research  groups. 

In  an  aspect  of  our  modeling  of  these  enzymes,  we  have  looked  at  the  diversity  of 
the  sequences,  and  by  consequence  diversity  of  functions  of  MMPs.  In  the  most  extensive 
study  of  its  kind,  we  just  reported  a  both  sequence  analysis  and  modeling  effort  to 
understand  function  of  the  structures  of  over  60  MMP.2  These  studies  indicated  that  the 
accepted  dogma  for  MMPs,  that  the  multiple  domains  of  these  proteins  assembled  to  give 
diversity  of  function  was  not  valid.  The  results  indicated  that  assembly  of  the  domains  was 
indeed  a  very  early  event  in  evolution  of  these  enzymes,  and  diversification  of  function 
took  place  after  such  initial  assembly.2  Furthermore,  we  have  discovered  four  distinct 
motifs  for  metal  binding  in  MMPs.3  It  is  interesting  that  the  three-dimensional  structures 
are  strictly  conserved,  despite  the  diversity  in  motifs.  These  indicate  that  after  assembly  of 
domains,  nature  went  in  many  different  evolutionary  tangents  to  realize  distinct  functions 
for  the  various  MMPs.  These  findings  bolster  our  view  that  the  original  proposal  by  us, 
that  selective  inhibition  of  these  enzymes  is  possible,  is  indeed  valid. 
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The  inhibitors  that  we  set  out  to  make  were  expected  to  bind  to  the  gelatinase  active 
site  and  allow  coordination  of  their  halide  moiety  with  the  active-site  catalytic  zinc  ion  of  the 
enzyme.  This  interaction  would  activate  the  halide  for  departure.  A  glutamate  is  near  the 
zinc  ion  in  the  active  site,  which  would  be  expected  to  displace  the  halide.  This  strategy 
was  implemented  successfully  for  the  model  zinc-dependent  protease,  carboxypeptidase  A 
recently,4,5  and  is  depicted  in  the  picture  below.  Modeling  with  the  coordinates  of 
gelatinases  has  guided  us  in  new  directions  for  design  of  such  inhibitors,  as  described 
below. 


protein  protein 


Inactivated  enzym 


i 


1.  Synthesis  of  iodine  containing  peptide  mimics  as  inactivators  for  MMPs 

We  had  to  make  molecules  that  were  designed  for  gelatinases  specifically.  And  the 
interactions  had  to  be  of  high  affinity.  Molecular  modeling  indicates  that  compounds  1  and 
2  should  bind  to  the  active  site  of  gelatinases.  Furthermore,  they  would  fulfill  the 
coordination  chemistry  and  the  predisposition  to  modification  by  the  glutamate.  Compound 
1  was  synthesized  according  to  Scheme  1. 


CbzHN 


CbzHN 


NHBn 


Ptl^ 

2 


Compound  1  inhibited  gelatinases  according  to  the  profile  for  mechanism-based  inhibition. 
However,  the  affinity  of  the  enzymes  was  not  very  high  for  this  compound  (millimolar 
range). 
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Scheme  1 

The  attempted  synthesis  of  compound  2  (scheme  2)  started  with  the  Evans  oxazolidinone 
3,  and  involved  a  stereospecific  aldol  reaction.  Subsequent  cleavage  of  the  auxiliary  with 
lithium  hydroxide  followed  by  a  DCC  coupling  reaction  with  benzylamine  led  to  the 
precursor  4  in  high  yield.  The  conversion  of  the  secondary  alcohol  of  4  into  an  iodo 
group  was  attempted  using  numerous  methods  (over  a  dozen  methds  were  tried),  none  of 
which  were  successful.  The  methods  attempted  included  the  treatment  of  the  alcohol  with 
:PPh3,  imidazole,  I2;  P2I4,  CS2;  N-methyI-N,N’-dicyclohexylcarbodiimidium  iodide;  1- 
methyl-2-fluoropyridinium  p-toluenesulfonate,  Nal;  3-ethyl-2-fluorobenzothiazolium 
tetrafluoroborate,  Nal;  Tf20,  pyridine,  Bu4NI;  MsCl,  pyridine,  Nal;  PPh3,  DEAD,  Znl2; 
PPh3,  DEAD,  Mel;  TMSCl/Nal,  CH3CN;  SOCl2,  Nal,  CH3CN;  DMF,  (COCl)2,  Nal. 
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2.  Synthesis  of  the  epoxide  5  and  the  hydroxyiodide  6. 

It  has  been  demonstrated  that  an  epoxide  group  can  be  a  potent  and  effective  candidate  for 
inactivation  of  carboxypeptidase  A,  a  model  zinc  protease.6  The  principle  is  the  same  as  the 
halide  strategy.  The  epoxide  oxygen  would  coordinate  to  the  active-site  zinc,  and  it  is 
displaced  by  the  incoming  nucleophile.  For  targeting  of  gelatinase,  epoxide  of  5  would 
coordinate  to  the  active-site  zinc  ion,  thus  increasing  the  reactivity  of  the  oxirane  towards 
attack  by  the  glutamate.  Subsequent  attack  of  the  activated  oxirane  by  the  active-site 
glutamic  acid  would  result  in  the  covalent  bonding  of  5  to  the  enzyme,  resulting  in  the  loss 
of  enzyme  activity.  The  reason  that  6  was  chosen  as  a  target  molecule  was  because  we 
expected  that  both  the  hydroxyl  and  iodo  groups  would  coordinate  to  the  active-site  zinc 
ion.  This  chelation  may  increase  the  affinity  for  the  site.  Compound  5  was  prepared 
(scheme  3)  but  all  attempts  to  date  to  convert  the  epoxide  into  the  hydroxy  iodo  6  failed. 
Furthermore,  enzymatic  testing  revealed  that  compound  5  did  not  inactivate  the  enzyme. 
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BuLi,  -  78  °C 


H2,  Pd/C 


NHCH3 


Scheme  3 


Similarly,  attempts  where  made  to  synthesize  the  peptide  based  epoxide  7  (scheme  4). 
At  first  it  seemed  as  though  the  reaction  to  form  the  epoxide  7  had  been  successful,  but 
later  attempts  to  assign  the  'H  NMR  spectrum  for  this  product  proved  impossible.  The 
'H  NMR  spectrum  did  appear  to  show  a  1:1  mixture  of  diastereoisomers,  as  would  be 
expected  for  a  reaction  of  this  type,  but  not  all  the  chemical  shift  values  agree  with  the 
proposed  structure.  The  reaction  to  convert  the  desired  epoxide  7  into  a  halohydrin  was 
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attempted  before  detailed  investigation  of  the  'H  NMR  spectrum  of  7  was  carried  out. 
Unable  to  prepare  either  the  epoxide  or  an  iodohydrin  via  this  route,  an  alternative 
synthesis  was  sought  (scheme  5).  The  idea  behind  this  route  was  to  prepare  the  epoxide 
or  the  iodohydrin-containing  fragment  as  an  individual  compound  and  then  attach  it  to  the 
peptide  component  using  a  DCC  coupling  reaction.  Although  we  successfully  prepared 
both  the  desired  epoxide  8  and  iodohydrin  9,  all  attempts  to  remove  the  benzyl  group 
failed.  When  the  epoxide  was  treated  with  hydrogen  in  the  presence  of  10%  Pd-C,  the 
lactone  10  was  formed  (Scheme  6),  and  when  attempts  were  made  to  remove  the  benzyl 
group  from  the  iodohydrin,  a  complex  mixture  of  compounds  was  obtained. 


NHCbz 

rr0H 
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Scheme  5 


Scheme  6 


3.  Sysnthesis  of  nonpeptide  epoxide  and  hydroxyiodide  compounds  as 
Gelatinase  inactivators. 

It  was  found  that  the  peptide  mimic  epoxide  5  was  unstable  during  purification.  It  is  for 
this  reason  that  nonpeptide  epoxide  inactivators  (11,12,15,16)  and  hydroxyiodide 
(13,14,17,18)  were  designed  and  the  syntheses  are  currently  under  investigation. 
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Scheme  8 


Key  Research  Accomplishments 

*  Synthesized  a  number  of  derivatives  leading  to  proposed  target  molecules. 

*  Two  of  the  potential  gelatinase  inhibitors  were  successfully  synthesized.  One 
showed  mechanism-based  inhibition  of  the  enzyme,  though  affinity  was  low. 

*  New  target  molecules  were  designed  using  computational  modeling. 

Reportable  Outcomes 

*  Two  papers  have  resulted  from  our  modeling  analyses.  They  are  provided  as 
appendices. 

*  The  database  for  sequence  analyses  for  MMPs,  and  the  coordinates  for  their 
models,  are  provided  to  the  public  in  our  group  Web  site. 

Conclusions 

We  have  synthesized  several  molecules,  in  multiple  steps  each,  as  potential 
inactivators  of  gelatinases.  These  molecules  establish  some  of  the  required  elements  in 
binding  to  the  active  site,  and  one  show  time-dependence  for  loss  of  activity,  which  is 
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required  for  the  inactivators  of  our  design.  Our  recent  results  point  us  in  the  direction  of 
improving  our  design  paradigms,  which  should  lead  to  molecules  of  high  potency  for 
inhibition  of  gelatinases. 
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Appendix  III 


Matrix  metalloproteinases:  structures,  evolution,  and 
diversification 


IRINA  MASSOVA,*'1  LAKSHMI  P.  KOTRA,*  RAFAEL  FRIDMAN  f 
AND  SHAHRIAR  MOBASHERY*’* 

♦Department  of  Chemistry,  and  ^Department  of  Pathology  and  Karmanos  Cancer  Institute 
State  University,  Detroit,  Michigan  48202-3489,  USA 


ABSTRACT  yA  comprehensive  sequence  alignment 
of  64  members  of  the  family  of  matrix  metallopro¬ 
teinases  (MMPs)  for  the  entire  sequences,  and  sub¬ 
sequently  the  catalytic  and  the  hemopexin-like  do¬ 
mains,  have  been  performed.  The  64  MMPs  were 
selected  from  plants,  invertebrates,  and  vertebrates. 
The  analyses  disclosed  that  as  many  as  23  distinct  sub¬ 
families  of  these  proteins  are  known  to  exist.  Infor¬ 
mation  from  the  sequence  alignments  was  correlated 
with  structures,  both  crystallographic  as  well  as  com¬ 
putational,  of  the  catalytic  domains  for  the  23  rep¬ 
resentative  members  of  the  MMP  family.  A  survey  of 
the  metal  binding  sites  and  two  loops  containing  vari¬ 
able  sequences  of  amino  acids,  which  are  important 
for  substrate  interactions,  are  discussed.  The  collec¬ 
tive  data  support  the  proposal  that  the  assembly  of 
the  domains  into  multidomain  enzymes  was  likely  to 
be  an  early  evolutionary  event.  This  was  followed  by 
diversification,  perhaps  in  parallel  among  the  MMPs, 
in  a  subsequent  evolutionary  time  scale.  Analysis  in¬ 
dicates  that  a  retrograde  structure  simplification  may 
have  accounted  for  the  evolution  of  MMPs  with  sim¬ 
ple  domain  constituents,  such  as  matrilysin,  from  the 
larger  and  more  elaborate  enzymes. — Massova,  L, 
Kotra,  L.  P.,  Fridman,  R.,  Mobashery,  S.  Matrix  me¬ 
talloproteinases:  structures,  evolution,  and  diversifi¬ 
cation.  FASEB J.  12,  1075-1095  (1998) 

Key  Words:  extracellular  matrix  •  MMP  •  hemopexin  •  tissue 
inhibitor  of  matrix  metalloproteinase 


BACKGROUND 

The  interactions  of  cells  with  the  extracellular  matrix 
(ECM)3  are  critical  for  the  normal  development  and 
function  of  the  organism.  Modulation  of  cell— matrix 
interactions  occurs  through  the  action  of  unique  pro¬ 
teolytic  systems  responsible  for  hydrolysis  of  a  variety 
of  ECM  components.  By  regulating  the  integrity  and 
composition  of  the  ECM  structure,  these  enzyme  sys¬ 
tems  play  a  pivotal  role  in  the  control  of  signals  elic¬ 
ited  by  matrix  molecules,  which  regulate  cell  prolif¬ 
eration,  differentiation,  and  cell  death.  The  turnover 
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and  remodeling  of  ECM  must  be  highly  regulated 
since  uncontrolled  proteolysis  contributes  to  abnor¬ 
mal  development  and  to  the  generation  of  many 
pathological  conditions  characterized  by  either  ex¬ 
cessive  degradation  or  a  lack  of  degradation  of  ECM 
components.  Matrix  metalloproteinases  (MMPs)  are 
a  major  group  of  enzymes  that  regulate  cell-matrix 
composition.  The  MMPs  are  zinc-dependent  endo- 
peptidases  known  for  their  ability  to  cleave  one  or 
several  ECM  constituents,  as  well  as  nonmatrix  pro¬ 
teins.  They  comprise  a  large  family  of  proteases  that 
share  common  structural  and  functional  elements 
and  are  products  of  different  genes.  Ample  evidence 
exists  on  the  role  of  MMPs  in  normal  and  patholog¬ 
ical  processes,  including  embryogenesis,  wound  heal¬ 
ing,  inflammation,  arthritis,  and  cancer.  The  associ¬ 
ation  of  MMPs  with  cancer  metastasis  has  raised 
considerable  interest  because  they  represent  an  at¬ 
tractive  target  for  development  of  novel  antimetas¬ 
tatic  drugs  aimed  at  inhibiting  MMP  activity.  There¬ 
fore,  understanding  the  structure  and  function  of 
these  key  enzymes  has  significant  implications  for 
cancer  therapy  (1-5). 

Most  members  of  the  MMP  family  are  organized 
into  three  basic,  distinctive,  and  well-conserved  do¬ 
mains  based  on  structural  considerations:  an  amino- 
terminal  propeptide;  a  catalytic  domain;  and  a  hemo¬ 
pexin-like  domain  at  the  carboxy-terminal  (Fig.  1). 
The  propeptide  consists  of  approximately  80-90 
amino  acids  containing  a  cysteine  residue,  which  in¬ 
teracts  with  the  catalytic  zinc  atom  via  its  side 
chain  thiol  group.  A  highly  conserved  sequence 
(.  .  .PRCGXPD.  .  .)  is  present  in  the  propeptide. 
Removal  of  the  propeptide  by  proteolysis  results  in 
zymogen  activation,  as  all  members  of  the  MMP  fam¬ 
ily  are  produced  in  a  latent  form.  The  catalytic  do¬ 
main  contains  two  zinc  ions  and  at  least  one  calcium 
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Figure  1.  A)  Basic  domain  structures  of  MMPs.  The  images  for  the  propeptide  region  and  the  catalytic  and  homopexin-like 
domains  shown  here  are  from  crystallographic  sources.  The  propeptide  region  is  taken  from  the  X-ray  structure  for  stomelysin 
(PDB  code:  lslm)  and  the  remaining  portions  of  the  structure  are  taken  from  the  X-ray  structure  of  the  full-length  collagenase 
(PDB  code:  lfbl).  Catalytic  zinc  is  shown  as  an  orange  sphere;  calcium  ions  in  the  catalytic  domain  and  the  hemopexin-like 
domain  are  shown  in  cyan.  The  propeptide  region  is  shown  by  the  green  ribbon,  catalytic  domain  as  a  surface  in  pink,  hinge 
region  as  a  surface  in  white,  and  the  hemopexin-like  domain  is  represented  by  the  ribbon  drawing  in  yellow.  B)  Schematics  of 
the  domain  structures  of  the  23  representative  MMPs.  Catalytic  domain  (represented  by  green)  has  an  insertion  of  gelatin 
binding  domain  in  MMP -2  and  MMP-9.  In  all  other  MMPs,  the  catalytic  domain  is  a  continuous  entity. 
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ion  coordinated  to  various  residues.  One  of  the  two 
zinc  ions  is  present  in  the  active  site  and  is  involved 
in  the  catalytic  processes  of  the  MMPs.  The  second 
zinc  ion  (also  known  as  structural  zinc)  and  the  cal¬ 
cium  ion  are  present  in  the  catalytic  domain  approx¬ 
imately  12  A  away  from  the  catalytic  zinc.  The  cata¬ 
lytic  zinc  ion  is  essential  for  the  proteolytic  activity  of 
MMPs;  the  three  histidine  residues  that  coordinate 
with  the  catalytic  zinc  are  conserved  among  all  the 
MMPs.  Little  is  known  about  the  roles  of  the  second 
zinc  ion  and  the  calcium  ion  within  the  catalytic  do¬ 
main,  but  the  MMPs  are  shown  to  possess  high  affin¬ 
ities  for  structural  zinc  and  calcium  ions  (6,  7).  The 
hemopexin-like  domain  of  MMPs  is  highly  conserved 
and  shows  sequence  similarity  to  the  plasma  protein, 
hemopexin.  The  hemopexin-like  domain  has  been 
shown  to  play  a  functional  role  in  substrate  binding 
and/ or  in  interactions  with  the  tissue  inhibitors  of 
metalloproteinases  (TIMPs),  a  family  of  specific 
MMP  protein  inhibitors  (8,  9).  In  addition  to  these 
basic  domains,  the  family  of  MMPs  evolved  into  dif¬ 
ferent  subgroups  by  incorporating  and/or  deleting 
structural  and  functional  domains.  For  example, 
MMP-2  and  MMP-9,  also  known  as  gelatinases,  incor¬ 
porated  the  three  repeats  homologous  to  the  type-II 
module  of  fibronectin  into  the  catalytic  domain  that 
has  been  shown  to  be  involved  in  binding  to  dena¬ 
tured  collagen  or  gelatin  (10).  This  domain,  known 
as  the  gelatin  binding  domain  or  fibronectin  type-II- 
like  domain,  is  unique  to  the  gelatinases,  and  so  these 
enzymes  are  regarded  as  a  separate  subgroup  among 
members  of  the  MMP  family.  Incorporation  of  a  hy¬ 
drophobic  stretch  of  approximately  25  amino  acids, 
representing  a  putative  transmembrane  domain  at 
the  carboxy  terminus  and  recognition  motif  (RXKR) 
for  furin-like  convertases  at  the  end  of  the  propeptide 
domain,  is  a  characteristic  of  the  membrane-type 
MMPs  (MT-MMPs)  (11,  12)  except  MT4-MMP  (vide 
infra).  MMP-11  also  contains  this  furin  recognition 
motif  and,  similar  to  the  MT-MMPs,  it  is  processed 
into  the  active  form  intracellularly  (13).  Additional 
insertion  to  the  three  basic  MMP  domains  also  in¬ 
cludes  a  proline-rich  54  amino  acid  insertion  in 
MMP-9  with  sequence  similarity  to  the  chain  of 
collagen  V  (14) .  Finally,  MMP-7  lacks  the  hemopexin- 
like  domain  and  represents  the  smallest  member  of 
the  MMP  family. 

The  catalytic  activity  of  the  MMPs  is  regulated  at 
multiple  levels  including  transcription,  secretion,  ac¬ 
tivation,  and  inhibition.  The  last  is  accomplished  by 
members  of  the  TIMP  family,  which  presently  in¬ 
cludes  four  proteins:  TIMP-1,  TIMP-2,  TIMP-3,  and 
TIMP-4  (8,  15).  Binding  of  the  TIMPs  to  the  catalytic 
domain  results  in  efficient  inhibition  of  enzymatic  ac¬ 
tivity  of  MMPs.  In  the  case  of  gelatinases,  the  TIMPs 
have  been  shown  to  bind  to  the  zymogen  forms  of 
the  enzymes.  This  interaction  has  been  suggested  to 
provide  an  extra  level  of  regulation  by  potentially  pre- 
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venting  activation  (15,  16).  However,  it  has  recently 
been  shown  that  TIMP-2  forms  a  trimolecular  com¬ 
plex  on  the  surface  of  the  cell  with  MT1-MMP  and 
proMMP-2,  and  regulates  the  formation  and  levels  of 
..concentration  of  mature  MMP-2  (17).  The  crystal 
structure  of  the  catalytic  domain  of  MMP-3  in  com¬ 
plex  with  TIMP-1  has  been  solved  and  shows  that  Cys1 
of  the  inhibitor  interacts  with  the  catalytic  zinc  ion  of 
the  MMP  through  the  oc-amino  and  its  carbonyl 
group,  whereas  the  Thr2  side  chain  extends  into  the 
S/  specificity  pocket  of  the  enzyme  (9) .  A  critical  step 
in  the  control  of  MMP  activity  is  regulated  by  the  gen¬ 
eration  of  active  enzyme  species  with  proteolytic  ac¬ 
tivity.  The  process  of  activation  involves  sequential 
cleavage  of  the  propeptide,  which  disrupts  coordi¬ 
nation  between  the  cysteine  thiol  in  the  propeptide 
region  with  the  zinc  atom  in  the  catalytic  domain. 
This  process  is  proteolytic  and  may  involve  other 
MMPs  acting  in  a  cascade  of  zymogen  activation. 

MMPs  belong  to  the  superfamily  of  zinc-peptidases, 
and  the  evolutionary  relationship  of  this  superfamily 
has  been  reviewed  (18,  19).  Sang  and  Douglas  (20) 
analyzed  30  MMP  sequences  from  various  sources  by 
multiple-sequence  alignment,  but  their  study  was  lim¬ 
ited  to  analysis  of  the  primary  sequences.  A  total  of  66 
MMPs  have  been  sequenced  to  date,  of  which  17  are 
from  humans,  including  the  recently  discovered  hu¬ 
man  enamelysin  (MMP-20)  and  a  functional  enzyme 
encoded  by  the  mmp20  gene  (GenBank  accession 
number  AJ003147,  maps  to  chromosome  16)  in  the 
familial  Mediterranean  fevergene  in  humans  (21,  22) . 
These  human  enzymes  have  counterparts  in  other  ver¬ 
tebrates.  MMPs  have  even  been  identified  in  inverte¬ 
brates  (23-25)  and  three  have  recently  been  se¬ 
quenced  from  plant  sources  (mouse  ear  cress  MMPs, 
Table  1).  MMPs  are  probably  more  ancient  than  is 
currently  realized.  Their  origin  might  actually  be 
traceable  back  to  bacteria  in  that  a  certain  amino  acid 
sequence  for  Bacteroides  fragilis  metalloproteinase 
toxin-2  (GenBank  accession  number  U90931)  has 
59%  sequence  identity  to  the  continuous  27  amino 
acid  stretch  in  human  MMP-1,  which  includes  the  cat¬ 
alytic  zinc  binding  domain  and  the  4 methionine  turn* 
jCthis  is  a  strictly  conserved  region  with  a  methionihe 
in  the  catalytic  domain  of  MMPs  responsible  for  the 
structural  integrity  of  the  zinc  binding  site).  A  salient 
question  is  why  such  multiplicity  of  these  enzymes  is 
seen  in  nature.  We  wish  to  add  to  this  question  an 
additional  inquiry:  What  makes  them  different,  and 
how  does  the  difference  in  amino  acid  sequences  give 
rise  to  structural  elements  that,  in  turn,  would  render 
a  given  MMP  a  distinct  enzyme?  We  have  compared 
amino  acid  sequences  from  64  MMPs  from  various  or¬ 
ganisms,  vertebrates,  invertebrates,  and  plants  to  ad¬ 
dress  these  questions.  Using  the  available  crystal  struc¬ 
tures  for  four  MMPs  (26-29),  we  have  modeled  the 
3-dimensional  structures  of  several  representative 
members  of  the  remaining  MMPs  to  gain  insight  into 
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TABLE  1.  Sources  of  various  MMPs  used  in  multiple-sequence  alignment 


Gene,  organism 

Common 

name 

Accession  # 

Database 

MMP-2  (gelatinase  A)  _ 

Mmp2,  Mus  mus cuius 

Mouse 

P33434 

Swiss-Prot 

Mmp2,  Rattus  norvegicus 

Rat 

P33436 

Swiss-Prot 

MMP2,  Homo  sapiens 

Human 

P08253 

Swiss-Prot 

Mmp2,  Oryctolagus  cuniculus 

Rabbit 

P50757 

Swiss-Prot 

Mmp2,  GaUus  gallus 

Chicken 

090611 

Swiss-Prot 

MMP-9  (gelatinase  B) 

Mmp9  or  Clgib,  Mus  mus  cuius 

Mouse 

P41245 

Swiss-Pro  t 

Mmp9,  Rattus  noruegicus 

Rat 

P50282 

Swiss-Prot 

MMP9,  Homo  sapiens 

Human 

P14780 

Swiss-Prot 

Mmp9,  Oryctolagus  cuniculus 

Rabbit 

P41246 

Swiss-Prot 

Mmp9,  Bos  taurus 

Bovin 

P52176 

Swiss-Prot 

Mmp9,  Synops  pyrrho gaster 

Newt 

Q98856 

TREMBL 

MMP-1 2  (macrophage  metalloelastase) 

Rattus  norvegicus 

Rat 

X98517 

GenBank 

Oryctolagus  cuniculus 

Rabbit 

088652 

GenBank 

MMP12  or  HME,  Homo  sapiens 

Human 

P39900 

Swiss-Prot 

Mmpl2  or  Mme,  Mus  musculus 

Mouse 

P34960 

Swiss-Prot 

MMP-13  (collagenase-3) 

Mmpl3 ,  Mus  musculus 

Mouse 

P33435 

Swiss-Prot 

Mmpl3,  Rattus  norvegicus 

Rat 

P23097 

Swiss-Prot 

MMP13,  Homo  sapiens 

Human 

P45452 

Swiss-Prot 

gene  A ,  Xenopus  laevis 

Frog 

U41824 

GenBank 

Mmpl  3,  Xenopus  laevis 

Frog 

Q10835 

Swiss-Prot 

Cynops  pyrrho  gaster 

Newt 

D82055 

GenBank 

CoIIagenase-4 

Xenopus  laevis 

Frog 

L76275 

GenBank 

MMP-3  (stromelysin-1) 

Mmp3,  Mus  musculus 

Mouse 

P28862 

Swiss-Prot 

Mmp3,  Rattus  norvegicus 

Rat 

P03957 

Swiss-Prot 

MMP3  or  STMY1,  Homo  sapiens 

Human 

P08254 

Swiss-Prot 

Mmp3,  Oryctolagus  cuniculus 

Rabbit 

P28863 

Swiss-Prot 

Equus  caballus 

Horse 

U62529 

GenBank 

MMP-10  (stromelysin-2) 

MMP10  or  STMY2 ,  Homo  sapiens 

Human 

P09238 

Swiss-Prot 

MmplO,  Rattus  norvegicus 

Rat 

P07152 

Swiss-Prot 

MMP-1  (interstitial  collagenase) 

Mmpl  or  Clg,  Bos  taurus 

Bovin 

P28053 

Swiss-Prot 

Mmpl,  Sus  scrofa 

Pig 

P21692 

Swiss-Prot 

MMP1  or  CLG,  Homo  sapiens 

Human 

P03956 

Swiss-Prot 

Mmpl,  Oryctolagus  cuniculus 

Rabbit 

P13943 

Swiss-Prot 

Rana  catesbeiana 

Bull  frog 

011133 

Swiss-Prot 

MMP-8  (neutrophil  collagenase) 

MMP8  or  CLG1,  Homo  sapiens 

CMMP 

Gallus  gallus 

Human 

Chicken 

P22894 

Swiss-Prot 

Enamelysin 

Bos  taurus 

Bovin 

AF009922 

GenBank 

Sus  scrofa 

PiR 

U54825 

GenBank 

MMP-7  (matrilysin) 

Mmp7,  Mus  musculus 

Mouse 

Q10738 

Swiss-Prot 

Mmpl,  Rattus  norvegicus 

Rat 

P50280 

Swiss-Prot 

Mmp7,  Felis  silvestris 

Cat 

P55032 

Swiss-Prot 

MMP7,  Homo  sapiens 

Human 

P09237 

Swiss-Prot 

MMP-1 1  (stromelysin-3) 

MMP11  or  STMY3,  Homo  sapiens 

Human 

P24347 

Swiss-Prot 

Mmpll,  Mus  musculus 

Mouse 

Q02853 

Swiss-Prot 

Xenopus  laevis 

Frog 

Q11005 

Swiss-Prot 

Rattus  norvegicus 

Rat 

P97576 

TRF.MBT 

MMP-1 6  (MT3-MMP) 

MMP16  or  MMPX2,  Homo  sapiens 

Human 

P51512 

Swiss-Prot 

MT3-MMP,  GaUus  gallus 

Chicken 

U66463 

(continued 

GenBank 
on  next  page) 
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TABLE  1.  ( continued ) 


Gene,  organism 

Common 

name 

Accession  # 

Database 

MMP-14  (MT1-MMP) 

MMP  14  or  MMP-X1 ,  Homo  sapiens 

Human 

P50281 

Swiss-Prot 

Mmpl4  or  MT-MMP,  Raltus  norvegicus 

Rat 

Q10739 

Swiss-Prot 

Mmpl4  or  MT-MMP \  Mus  mus cuius 

Mouse 

P53690 

Swiss-Prot 

Mmpl4,  Oryctolagus  cuniculus 

Rabbit 

095220 

Swiss-Prot 

MMP-1 5  (MT2-MMP) 

MMP 15,  Homo  sapiens 

Human 

P51511 

Swiss-Prot 

MMP-1 7  (MT4-MMP) 

MMP1 7,  Homo  sapiens 

Human 

X89576 

GenBank 

MMP-19 

*  MMP 19,  Homo  sapiens 

Human 

X92521 

GenBank 

Plant  metalloproteases 

Glycine  max 

soybean 

U63725 

GenBank 

Arabidopsis  thaliana 

Mouse  ear 

cress 

004529 

TREMBL 

Arabidopsis  thaliana 

Mouse  ear 

cress 

E327511 

GenBank 

Envelysin 

Hemicentrotus  pulcherrimus 

Sea  urchin 

AB000719 

GenBank 

Paracentrotus  lividus 

Sea  urchin 

P22757 

Swiss-Prot 

Zinc  protease 

T21D1 1. 1 ,  Caenorhabditis  elegans 

Nematode 

U00038 

GenBank 

XMMP 

Xenopus  laevis 

Frog 

Others 

Cynops  pyrrhogaster 

Newt 

D82053 

GenBank 

Cynops  pyrrhogaster 

Newt 

D82054 

GenBank 

their  similarities  and  differences  (30,  31).  We  have  an¬ 
alyzed  the  entire  sequences,  the  catalytic  domains,  and 
hemopexin-like  domains  of  the  64  members  of  the 
MMP  family  in  terms  of  structures,  evolution,  and  in¬ 
teractions  with  substrates  and  inhibitors.  We  also  pro¬ 
vide  an  analysis  of  the  structural  zinc  binding  site  and 
the  calcium  binding  site.  These  are  the  first  compre¬ 
hensive  analyses  of  this  important  family  of  enzymes, 
and  provide  fundamental  information  on  evolution 
and  properties  of  MMPs. 

EXPERIMENTAL  PROCEDURES 

Amino  acid  sequences  of  MMPs  were  obtained  from 
the  GenBank,  TREMBL,  and  Swiss-Prot  data  banks. 
The  sources  for  all  sequences  are  given  in  Table  1.  A 
total  of  64  different  MMPs  were  used  for  this  analysis 
[human  enamelysin  and  the  mmp20  gene  product 
with  GenBank  accession  #  AJ003147  were  not  in¬ 
cluded  because  they  were  reported  after  the  comple¬ 
tion  of  our  analysis.  However,  we  conducted  a  sepa¬ 
rate  multiple-sequence  analysis  with  the  64  sequences 
and  the  two  new  MMP  sequences  after  preparing  this 
review.  This  analysis  showed  that  the  clustering  pat¬ 
tern  was  identical  to  that  reported  in  this  article.  Hu¬ 
man  enamelysin  grouped  with  other  enamelysins, 
and  the  mmp20  gene  product  showed  the  closest  ho¬ 
mology  to  MT4-MMP].  The  multiple-sequence  align- 
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ments  were  performed  using  the  program  PileUp 
from  the  Wisconsin  package  version  9.  Four  human 
MMPs  [fibroblast  (MMP-1,  lcgl)  (26)  and  neutrophil 
(MMP-8,  lmnc)  (27)  collagenases,  matrilysin  (MMP- 
7,  Immq)  (28),  and  stromelysin-1  (MMP-3,  lslm) 
(29)]  have  recently  been  crystklized  and  their  coor¬ 
dinates  are  available.  We  used  structural  information 
to  predict  the  3-dimensional  structures  for  the  ho¬ 
mologous  metalloproteinases  using  the  program 
COMPOSER  (Tripos  Associates,  Inc.,  St.  Louis,  Mo.). 

We  recently  reported  the  computational  3-dimen¬ 
sional  models  for  the  catalytic  domains  of  MMP-2  and 
MMP-9  (30)  and  also  predicted  the  folding  of  an  ad¬ 
ditional  17  representative  MMP  enzymes  for  which 
such  information  has  been  lacking  (31).  These  17 
proteins  are  human  MMP-10,  MMP-11,  MMP-12, 
MMP-1 3,  MT1-MMP  (MMP-1 4),  MT2-MMP  (MMP- 
15),  MT3-MMP  (MMP-1 6),  MT4-MMP  (MMP-1 7), 
MMP-19  (also  referred  to  as  MMP-18),  pig  enamely¬ 
sin,  sea  urchin  envelysin,  stromelysin-like  MMP  from 
newt,  collagenases  from  frog,  nematode  MMP, 
chicken  CMMP,  frog  XMMP,  and  MMP  from  mouse 
ear  cress. 


RESULTS  AND  DISCUSSION 

Multiple-sequence  analysis  for  the  64  MMPs  was  con¬ 
ducted  on  three  different  sets  of  data.  In  the  first  set, 
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the  complete  sequences,  including  the  signal  and 
propeptide  regions,  were  used  (Fig.  24).  A  simplified 
schematic  presentation  of  Fig.  2  A  is  shown  as  Fig.  2 B. 
The  entire  sequences  were  used  to  understand  the 


overall  evolutionary  pathways  for  diversification.  Evo¬ 
lution  occurs  via  separate  events  of  sequence  modi¬ 
fication  of  the  entire  gene  such  as  point  mutations, 
insertions,  deletions,  gene  splitting,  and  fusions.  Mu’ 
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Figure  2.  ( continued) 

tations  occur  at  random  regardless  of  the  function  of 
the  region  of  the  gene  or  the  domain  in  order  to 
increase  diversity  throughout  the  genes  in  question, 
which  is  why  this  sequence  analysis  on  the  entire  pro¬ 
tein  is  informative.  The  second  and  third  sets  for  mul¬ 
tiple-sequence  analyses  were  carried  out  on  sequence 
stretches  corresponding  to  the  catalytic  and  hemo- 
pexin-like  domains  of  the  MMPs  (Fig.  3  and  Fig.  4, 
respectively) .  These  two  sets  were  chosen  to  study  the 
different  evolutionary  pressures  on  these  specific  do¬ 
mains  en  route  to  diversification.  Mutations  that  im¬ 
pair  catalytic  ability  would  not  be  selected.  In  con¬ 
trast,  substrate  specificities  for  the  majority  of  the 
MMPs  are  believed  to  have  been  determined  by  the 
interactions  of  protein  substrates  and  TIMPs  with  ei¬ 
ther  the  hemopexin-like  domain  or  the  gelatin  bind¬ 
ing  domain  (in  the  case  of  gelatinases)  (32,  33). 
Hence,  evolution  of  these  domains  may  reflect  evo¬ 
lution  of  substrate  specificities  and  interactions  with 
TIMPs.  Therefore,  comparison  of  the  data  for  anal¬ 
yses  of  the  catalytic  domain  and  hemopexin-like  do¬ 
main  with  those  of  the  complete  sequences  would 
provide  information  on  whether  the  assembly  of  the 
genes  encoding  these  domains  occurred  at  a  later 
time  than  the  differentiation  of  MMPs  into  separate 
subfamilies. 

Analysis  of  the  entire  sequences  and  the  separate 
analyses  of  the  catalytic  and  hemopexin-like  domains 
are  quite  revealing.  Analysis  of  the  entire  sequences 
and  of  the  catalytic  domains  gave  rise  to  22  distinct 
subfamilies  of  MMPs.  The  number  for  each  cluster  is 
given  in  brackets  and  is  arbitrary.  We  tried  to  corre¬ 
late  the  number  for  the  clusters  with  those  for  the 
given  MMP  when  possible.  For  example,  cluster  1  is 


given  to  MMP-1.  Each  analysis  produced  clusters  of 
enzymes  that  were  individually  comprised  of  MMPs 
of  a  given  type.  The  only  exceptions  were  MMP-3  and 
MMP-10  (stromelysins-1  and -2,  respectively) ,  which 
clustered  into  one  subfamily -(Figs.  2  and-3)  .  How¬ 
ever,  these  stromelysins  formed  two  independent 
groups  in  the  alignment  of  the  hemopexin-like  do¬ 
mains,  giving  rise  to  a  total  of  23  distinct  subfamilies 
of  MMPs  for  this  analysis.  Figure  5  provides  the 
alignment  for  the  amino  acid  sequences  of  the  23 
representative  enzymes  using  the  PileUp  program. 
This  alignment  was  made  consistent  with  the  align¬ 
ments  found  by  the  program  COMPOSER,  which 
takes  into  account  the  predicted  folded  structures 
of  the  proteins.  COMPOSER  assigns  different  con¬ 
tributions  for  the  ‘structurally  conserved  regions*, 
such  as  elements  of  well-defined  secondary  struc¬ 
ture,  for  example,  P-strands  and  a-helices,  as  op¬ 
posed  to  loops,  which  are  considered  to  be  variable 
areas.  In  contrast,  PileUp  does  not  differentiate  be¬ 
tween  secondary  structure  elements  as  does  COM¬ 
POSER,  but  rather  performs  the  alignment  based 
solely  on  sequence  homology  of  amino  acids.  There¬ 
fore,  sequence  alignments  found  by  COMPOSER 
and  PileUp  are  based  on  different  principles,  yet 
they  have  the  potential  to  complement  each  other. 
We  have  edited  the  result  of  the  PileUp  alignment 
from  the  insight  gained  by  the  COMPOSER  analysis, 
since  the  3-dimensional  structural  information 
would  enhance  reliability  of  this  type  of  analysis  con¬ 
siderably.  Major  protein  structural  blocks,  ^-strands 
and  a-helices,  have  a  greater  tendency  to  be  con¬ 
served  during  the  evolutionary  process  than  do 
mere  sequences  of  amino  acids. 

Multiple-sequence  analysis  of  the  complete 
sequences  of  MMPs 

The  22  major  subfamilies  of  MMPs,  as  discerned  from 
the  multiple-sequence  analysis  and  the  dendrogram 
for  the  64  MMPs,  are  shown  in  Fig.  2A,  with  a  sim¬ 
plified  version  depicted  in  Fig.  2 B.  The  chromosome 
locations  for  human  MMPs  are  shown  (in  circles)  in 
Fig.  2 B  when  available.  Members  of  each  subfamily 
generally  display  similar  substrate  profiles  in  different 
organisms.  MMP-3  and  MMP-10  clustered  in  one 
group,  and  show  almost  identical  properties  and  sub¬ 
strate  profiles  (8).  Genes  of  the  more  closely  clus¬ 
tered  human  MMPs  all  seem  to  have  originated  from 
the  same  chromosome  11,  reflecting  the  evolutionary 
process  suggested  in  Fig.  2 B. 

It  has  been  proposed  that  the  origin  of  MMPs 
could  be  traced  to  before  the  emergence  of  verte¬ 
brates  from  invertebrates  (34,  35).  Recent  sequenc¬ 
ing  of  three  plant  MMPs  found  these  enzymes  to  be 
homologous  to  MMPs  from  vertebrate  and  inverte¬ 
brate  origins,  indicating  that  this  evolutionary  pro¬ 
cess  is  even  more  ancient  than  previously  appreciated 
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acids  in  the  catalytic  domains  for  64  MMPs. 


(Table  1)  (36).  Furthermore,  the  discovery  of  a  pep¬ 
tide  sequence  for  a  metal-containing  enzyme  from  B. 
fragilis  suggests  that  MMPs  may  be  more  ancient  yet 
(37) ,  given  that  bacteria  have  been  around  for  longer 
than  3.5  billion  years.  The  three  main  branches  of 


the  dendrogram  in  Fig.  2 A  give  rise  to  the  lines  that 
lead  to  enzymes  from  the  vertebrate  (depicted  as  the 
V  branch),  invertebrate  (depicted  as  the  ‘i’  branch), 
and  plant  (depicted  as  the  ‘p’  branch)  MMPs,  re¬ 
spectively.  The  only  exception  is  the  XMMP  from 
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Figure  4.  Dendrogram  for  the  multiple-sequence  analysis  of  the  hemopexin-like  domain  sequences  for  64  MMPs.  Numbers  in 
parentheses  represent  the  class. 


Xenopus  (38).  Plant  and  invertebrate  MMPs  show  a 
stronger  relationship  among  themselves  than  to  the 
vertebrate  MMPs  (Fig.  2 A).  Thus,  the  vertebrate  en¬ 
zymes  are  more  remotely  related  to  plant  and  inver¬ 
tebrate  MMPs.  Although  one  would  expect  that  the 
plant  MMPs  are  the  most  ancient,  and  therefore  the 
enzymes  least  related  to  all  other  MMPs,  our  analysis 
shows  that  the  frog  XMMP  is  the  enzyme  least  related 


to  all  other  MMPs,  itself  forming  a  separate  group. 
This  indicates  that  XMMP  either  represents  a  sepa¬ 
rate  yet  unidentified  group  of  MMPs  or  is  the  last 
extant  member  of  a  primordial  MMP.  The  first  pos¬ 
sibility  seems  to  be  more  likely  in  our  view,  because 
XMMP  possesses  a  hemopexin-like  domain  that  is  ab¬ 
sent  in  plant  MMPs  and  the  nematode  enzyme.  In¬ 
deed,  the  plant  and  nematode  MMPs  have  the  sim- 
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plest  domain  structures.  It  may  be  suggested  that  at 
some  point  during  evolution,  the  genes  encoding  the 
primordial  MMP  and  hemopexin-like  domain  joined 
together,  and  XMMP  probably  originated  after  this 
event.  Based  on  our  analysis  of  Fig.  2A,  we  see  that 
the  sequence  of  MMP-7  fits  well  into  the  alignment 
of  the  entire  sequences.  This  is  also  true  for  the  se¬ 
quence  of  the  catalytic  domains  (vide  infra),  indicat¬ 
ing  that  this  enzyme  has  not  existed  as  an  evolu¬ 
tionary  exception,  or  oddity,  by  not  having  the 
hemopexin-like  domain.  In  light  of  what  appears  to 
be  an  independent,  and  perhaps  parallel,  evolution 
for  MMP-7,  the  lack  of  hemopexin-like  domain  in  ma- 
trilysin  represents  a  deletion  of  this  domain  during 
evolution  (Fig.  1A).  ° 

One  can  see  four  major  subgroups  within  the  ver¬ 
tebrate  branch  (Fig.  2A).  One  is  formed  by  MMP-19 
itself  (some  refer  to  this  as  MMP-18),  which  is  least 
related  to  other  vertebrate  MMPs,  besides  XMMP. 
The  next  branch  is  that  of  MMP-1 1  (stromelysin-3) 
clustered  together  with  the  MT-MMPs.  MT4-MMP 
(MMP-1 7)  shares  less  similarity  to  other  members  of 
this  branch  of  membrane-type  enzymes,  and  the  rest 
of  this  subgroup  is  more  closely  related  to  MMP-1 1 
than  to  MT4-MMP.  Both  MMP-1 1  and  the  MT-MMPs 
possess  the  furin  recognition  site  and  can  be  activated 
by  furin-like  convertases  (11,  39).  The  third  branch 
is  comprised  exclusively  of  gelatinases,  whereas  the 
last  branch  is  made  up  of  all  remaining  vertebrate 
MMPs. 

Figures  2-4  reveal  22-23  distinct  MMPs  subfami¬ 
lies,  sequence  similarities  of  which  were  preserved 
among  various  organisms  during  evolution.  It  is  likely 
that  one  would  find  counterparts  to  each  MMP  in 
various  vertebrate  organisms.  The  entire  amino  acid 
sequence  alignment  of  the  23  representative  mem¬ 
bers  of  these  subfamilies  is  shown  in  Fig.  5.  Human 
enzymes  have  been  selected  in  all  cases  when  avail¬ 
able.  The  simplified  domain  structures  of  these  23 
MMPs  are  shown  in  the  schematic  of  Fig.  IB.  Consis¬ 
tent  with  earlier  knowledge,  the  analysis  depicted  in 
Fig.  5  and  the  alignment  produced  by  the  PileUp  pro¬ 
gram  revealed  that  all  MMPs,  except  that  from  nem¬ 
atode,  have  signal  sequences,  followed  by  the  pro¬ 
peptide  region  containing  the  characteristic  motif 
called  the  cysteine  switch’  (40).  The  common  pat¬ 
tern  for  this  cysteine  switch  is  [PSL]-[RT]-C-[GS]- 
[VNL]-[PASYE]-D,  where  boldface  letters  designate 
the  most  commonly  found  amino  acids.  In  addition, 


MMP-1 1  and  the  MT-MMPs  (except  MT4-MMP)  con¬ 
tain  a  RXKR  motif  in  the  propeptide  that  has  been 
postulated  to  represent  a  cleavage  site  for  furin-like 
enzymes  (11,  13,  39).  This  cleavage  pattern  also  ap¬ 
pears  in  nematode  MMP  (two  repeats)  and  in 
XMMP,  as  seen  in  the  sequence  alignment.  MT4- 
MMP  and  the  mmp20  gene  product  have  a  variation 
of  the  furin-like  recognition  sequence  as  RRRR  (in¬ 
stead  of  RXKR)  at  this  position. 

The  sequence  of  the  nematode  MMP  seems  to  be 
considerably  shorter  at  the  amino  terminus  (24), 
which  either  reflects  that  the  amino-terminal  portion 
of  the  enzyme  was  not  sequenced  entirely  or  that  this 
enzyme  has  a  totally  different  mechanism  for  activa¬ 
tion.  The  amino-terminal  portion  of  the  nematode 
MMP  that  precedes  the  catalytic  domain  has  only  48 
amino  acids,  compared  to  100-200  amino  acids  in 
other  MMPs.  The  propeptide  sequence  forms  a  cap 
over  the  active  site  of  this  MMP,  with  the  critical  cys¬ 
teine  residue  providing  the  fourth  coordination  to 
the  catalytic  zinc  ion,  as  seen  in  the  crystal  structure 
of  stromelysin-1  (41).  We  also  see  a  conserved  sig¬ 
nature  for  the  catalytic  zinc  ion  binding  site  of  MMPs, 
with  the  consensus  pattern  containing  a  so-called  me¬ 
thionine  turn  (40,  42),  [ VAIT] -[AG] - [ATV] -7/-E- 
[FLI V]  -G -H-  [ALMSV]  -  [LIM]  -G-  [LM]  -X-£T-[SITV]  - 
X(5)-[LAFIV]-M,  where  X  denotes  any  residue  and 
X(5)  means  there  are  five  residues  between  the  flank¬ 
ing  sites.  The  three  histidines  shown  in  italics  chelate 
the  catalytic  zinc  ion,  and  the  methionine  (also  de¬ 
picted  in  italics)  is  located  underneath  the  cavity 
formed  by  these  histidines,  providing  increased  hy- 
drophobicity  in  this  area  to  enhance  zinc  binding 
ability  of  histidines  (42) .  ° 

Multiple-sequence  analysis  of  regions  encoding  the 
catalytic  domain  of  MMPs 

Figure  3  shows  the  multiple-sequence  analysis  carried 
out  for  the  regions  encoding  exclusively  the  catalytic 
domains.  Again  one  sees  22  distinct  clusters,  with 
MMP-3  and  MMP-10  in  one  group.  We  no  longer  see 
three  clearly  delineated  (separate)  large  groups  for 
invertebrate,  vertebrate,  and  plant  MMPs,  although 
there  appears  to  be  some  tendency  for  MMPs  from 
these  sources  to  group  together.  The  catalytic  do¬ 
mains  of  the  MT-MMPs  do  not  usually  appear  to  be 
closely  related  to  one  another  (for  example,  position 
of  MT4-MMP  in  Fig.  3).  An  unexpected  finding  was 
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ftipml  sequence 
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MMP1  human  - -  - 

MMP2  human  - - - - MEAL 

MMP3  human  - - - - - -  - - - 

MMP7  human  -  -  - 

MMP8  human  -  -  - 

MMP9  human  -  - - - - - - ^ 

MMPIO  human  - -  - 

MMP1 1  human  - -  - 

MMP12  human  - - - -  - 

MMP13  human  -  -  - 

MTI-MMP  human - *.«-« — _ _ _  _ 

MT2-MMP  human  - - - -  - M^cinpc 

MT3-MMP  human - - - 

MT4-MMP  human  — - -  -  - 

MMP 19  human  - - -  - 

Collagenase-4  frog  -  -  - 

MMP  newt  - - -  - 

CMMP  chicken  -  -  - - - - 

Enamelysin  pig  -  -  - 

Envelysin  MANSGLILLVMFMIHVTTVH  NVPLPSTAPSIITQLSDITT  SIXEEDAFGLTTPTTGLLTP 

MMP  nematode -  -  - 

XMMP  frog - 

MMP  cress  - - MRFCVFGFL  SLF - LIVSPASAWFFPNS 


- MHSFPPL-LLLLFWGW 

MARGALTGPLRALCLLGCLL 
— * - MKSLPI LLLLCVAV 

- mrltvlcavcl 

- MFSLKTLPFLLLLHVQI 

SLWQPLV — LVLLVLGCCFA 

- MMHLAFLVLLCLPV 

- MAPAAWLRSAAAR 

- MKFLLILL - L 

- MHPGVLAAFLFLSWTH 

- M3PAPRPSR 

APGRPGWTGSLLGDREEAAR 
- MILLTFSTGRRLDF 


- MNSLLLKLLLCVAI 

- MKILSL-LLLCAAG 

MERSVRMKNVLLLLLTYAAV 

- MKVLPASGLAVLLVTAL 

VSENDSDDDGD - DITTI 


- MPSIKLLVWCCLCV 

TAVPPSLRNTTRVFWD - 


_ ^  ^ _ _ _ Propeptide  sequence 

MMP1  human  SHSFP-ATL - ETQEQ — - DVDL - VQKY  LEKY - 

MMP2  human  SHAAAAPSPIIKFPGDVAPK  — TDKEL - AVQY  LNTF - 

MMP3  human  CSAYPLDGA - ARGED —  — TSMNL - VQKY  LENY - 

MMP7  human  LPGSLALPL - PQEAGGM  SELQWEQ - AQDY  LKRF - 

MMP8  human  SKAFP - V - SSKEK — - NTKT - VQDY  LEKF - 

MMP9  human  APRQRQSTL-VLFPGDLRT-  NLTDRQL - AEEY  LYR - - — 

MMPIO  human  CSAYPLSGA - AKEED—  — SNKDL - AQQY  LEKY - 

MMPI1  human  ALLPP - MLLLLLQPPPL  LARAL -  - 

MMPI2  human  QATASGALP - LNSSTSL  EKNNVLF - GERY  LEKF - 

MMP13  human  CRALPLP-S - GGDEDDL  SEEDLQF - AERY  LRSY - 

MTI-MMP  human  CLLLPLLTLGTALASLGS  —  -AQSSSF - SPEA  WLQQ - 

MT2-MMP  human  PRLLPLLLV — LLGCLGLGV  AAEDAEV - HAEN  WLRL - 

MT3-MMP  human  VHHSGVFFLQTLLWILCATV  CGTEQYF - NVEV  WLQK - 

MT4-MMP  human - - - - - - E  WLSR _ - _ 

MMP19  human  -MNCQQLWLGFLLPMTVSGR  VLGLAEV - APVD  YLSQ - 

Collagenasc-4  frog  TAAFP-ADK - QDEPPA-  — TKEEM - AENY  LKRF - 

MMP  newt  AYAVQ-EAP - VHEEDD-  — rTIRQD - VEEY  LKKY - 

CMMP  chicken  SNSLPAQPE - KDNKE — - DTKL - VEDY  LSKF - 

Enamelysin  pig  KFSAAAPSL - FAATPRT  SRNNYHL - AQAY  LDKY - 

Envelysin  QTTTS S SQTVI SGWVEEGV  HESNVEI - LKAH  LEK - 

MMP  nematode -  - - - - 


-YNLKNDGRQVEKRR- 

-YGCPKE - S- 

-YDLKKDVKQFVRRK- 

-YLYDSE - T- 

-YQLPSNQYQSTRKN- 

-YGYTRVAE - MR- 

-YNLEKDVKQF-RRK- 

- PPDVHH — LH- 

-YGLEINKLPVTKMK- 
-YHPT-NLAGILKEN- 
-YGYLPPGDLRTHTQ- 
-YGYLPQPSRHMSTM- 
•YGYLPPTSPRMSW- 
-FGYLPPADPTTGQL- 
■YGYLQKPLE-GSNN- 
■YSLGTDGGPVGRKK- 
■YGLNSDKTPDLRKA- 
‘  YTI ETDS  NQRGWKA- 
•YTKKGGHQVGEMVA- 
■FGYTP — PGSTFGE- 


XMMP  frog  ISPRLCHSEKLFHSRDRSDL  QPSAIEQAELVKDMLSAQQF  LAKYGWTQPVXWDPSSTKEN  EPLKDFSLMQEGVCNPRQEV 
MMP  cress - AFSNFTGCHHGQ  KVDGLYR - IKKY  FQR -  - FGYIPETFSGNFTD- 


MMPI  human 
MMP2  human 
MMP3  human 
MMP7  human 
MMP8  human 
MMP9  human 
MMPIO  human 
MMP1 1  human 
MMP12  human 
MMP13  human 
MTI-MMP  human 
MT2-MMP  human 
MT3-MMP  human 
MT4-MMP  human 
MMP19  human 
ColIagcnasc-4  frog 
MMP  newt 
CMMP  chicken 
Enamelysin  pig 
Envelysin 
MMP  nematode 
XMMP  frog 
MMP  cress 


- - NSGPWEKLKQMQEFF 

- CNLFVLKDTLKKMQKFF 

- DSGPWKKIREMQKFL 

- KNANSLEAKLKEMQKFF 

- GTNVIVEKLKEMQRFF 

- G  ES  KS  LG  P ALLLLQKQL 

- DSNLIVKKIQGMQKFL 

- AERRGPQPWHAALPSSP 

- YSGNLMKEKIQEMQHFL 

- AAS  SMTERLREMQS FF 

- RSPQSLSAAIAAMQKFY 

- RSAQILASALAEMQRFY 

- RSAETMQSALAAMQQFY 

- QTQEELSKAITAMQQFG 

- FKPEDITEALRAFQEAS 

- HIQPFTEKLEQMQKFF 

- A-S  PLAEKIREMQKFC 

- NAEFTAEKLQKMQRFF 

- KGGNSMVKKIKELQAFF 

- A-NLNYTSAILDFQEHG 


_ Propeptide  sequence  continued 

GLKVTGKPDAETLKVMKQPR  CGVPDV - 

GLPQTGDLDQNTIETMRKPR  CGNPDV - 

GLEVTGKLDSDTLEVMRKPR  CGVPDV - 

GLPITGMLNSRVIEIMQKPR  CGVPDV - 

GLNVTGKPNEETLDMMKKPR  CGVPDS - 

SLPETGELDSATLKAMRTPR  CGVPDL - 

GLEVTGKLDTDTLEVMRKPR  CGVPDV - 

APAPATQEAPRPASSLRPPR  CGVPD - 

GLKVTGQLDTSTLEMMHAPR  CGVPDV - 

GLEVTGKLDDNTLDVMKKPR  CGVPDV - 

GLQVTGKADADTMKAMRRPR  CGVPDK - 

GIPVTGVLDEETKEWMKRPR  CGVPDQ - 

GINMTGKVDRNTIDWMKKPR  CGVPDQ - 

GLEATGILDEATLALMKTPR  CSLPD - 

ELPVSGQLDDATRARMRQPR  CGLEDP - 

GLKVTGTLDPKTVEVMEKPR  CGVYDV - 

GLQVTGKVDSNTLEVMQQPR  CGVSDV - 

GLKVTGKPDTETLEMMKKPR  CGVPDV - 

GLRVTGKLDRTTMDVIKRPR  CGVPDV - 

GINQTGILDADTAELLSTPR  CGVPDV - 


AEPTKSPQFIDALKKFQKLN  NLPVTGTLDDATINAMNKPR 
- DFDDILKAAVELYQTNF  NLNVTGELDALTIQHIVIPR 


- MRSILLFILI 

CGVPDNQMAKKETEKPTAAQ  SLENKTKDSENVTQQNPDPP 
CGNPDWNGTSLMHGGRRKT  F - - - 


( continued  on  next  page) 
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MMPlhmnan  - AQFVLTEGNPRWE 

MMD?LlUmaU  -  rANYNFFPRKPKWD 

MMP7  !UJinan  - GHFRTFPGIPKWR 

WMpI^man  - AEYSLFPNSPKWT 

MMDotlUman  -  GGFMLTPGNPKWE 

MMP9  human  -  GRFQTFEGDLKWH 

I!0'00"  -  GHFSSFPGMPKWR 

^U™n  - PS  DGLSARNRQKRFVLSGGRWE 

™  -an  - - HHFREMPGGPVWR 

f^Uman  -  GEYNVFPRTLKWS 

»mwu!JUman  - 7-FGAE  IKANV — RRKRYAIQGLKWQ 

fl?™™ - FGVR  VKANLRRRRKRYALTGRKWN 

uSwwnL  ^ - TRGS  SKFHI— RRKRYALTGQKWQ 

Mwwn,nrUman - - - LPVLTQARRRRQAPAPTKWN 

MMP19  human - - FN — - QKTLKYLLLGRWR 

Collagenasc-4  frog  - GQYSTVAKSSAWQ 

MMPncwt  - AAYSTFPGRPAWR 

CMMP  chicken  - - GLYGV — TLPGWK 

Enamclysin  pig  - ANYRLFPGEPKWK 

- - LPFVTSSITWS 

MMP  nematode  FLIFAISEAQENIDKNLDFI  KPIGFGSREKRYVIRAKRWK 
XMMP  frog  KIRRKRFLDMLMYSNKYREE  QEALQKSTGKVFTKKLLKWR 
MMP  cress  - EVNFS  RTHLHAVKRYTL FPGE PRW P 


Catalytic  domain _ 

QT - HLTYRIENYTPD — L 

KN - QITYRIIGYTPD — L 

KT - HLTYRIVNYTPD— L 

SK - VVTYRIVSYTRD — L 

RT - NLTYRIRNYTPQ--L 

HH NITYWIQNYSED — L 

KT HLTYRIVNYTPD — L 

KT DLTYRILRFPWQ — L 

KH YITYRINNYTPD — M 

KM NLTYRIVNYTPD — M 

HN EITFCIQNYTPK — V 

NH HLTFSIQNYTEK- — L 

HK HITYSIKNVTPK — V 

KR - NLSWRVRTFPRDS  PL 

KK HLTFRILNLPST — L 

KK - DLTYRILNFTPD — L 

TH ALTYRILNYTPD — M 

KN - NLTYRIVNYTPD— L 

KN - TLTYRISKYTPS — M 

RNQ — PVTYSFGALTSD — L 

KHTLTWQLQTQNLLDPD - 

M - IGEGYSNQ — L 

RNRR-DLTYAFDPKNPLT — 


Catalytic  domain  continued  (second  Zn2+-  and  Ca2+-binding  subdomain) 

uUman  PLTFrKV - SEGQA  DIMISFVRGDHRDNSP-FDG  PGGNLAHAFQPGPGIGGD-A 

^  PLRFSRI - HDGEA  DIMINFGRWEHGDGYP-FDG  KDGLLAHAFAPGTGVGGD-S 

w^uman  PLTFSRL - YEGEA  DIMISFAVREHGDFYP-FDG  PGNVLAHAYAPGPGINGD-A 

uuDouUman  PLHFRKV - VWGTA  DIMIGFARGAHGDSYP-FDG  PGNTLAHAFAPGTGLGGD-A 

f^Uman  PLTrTRI - SQGEA  DINIAFYQRDHGDNSP-FDG  PNGILAHAFQPGQGIGGD-A 

^IoinuUman  PLTFTRV - YSRDA  DIVIQFGVAEHGDGYP-FDG  KDGLLAHAFPPGPGIQGD-A 

J1wDi?l!Uman  PLTFSRL - YEGEA  DIMISFAVKEHGDFYS-FDG  PGHSLAHAYPPGPGLYGD-I 

/Uman  PLTFTEV - HEGRA  DIMIDFARYWDGDDLP-FDG  PGGILAHAFFPKTHREGD-V 

PLKFSKI - NTGMA  DILWFARGAHGDFHA-FDG  KGGILAHAFGPGSGIGGD-A 

irnm  nu  PLNFTRL - HDGIA  DIMISFGIKEHGDFYP-FDG  PSGLLAHAFPPGPNYGGD-A 

^  wwn?uman  PLRFREVPYAYIREGHEKQA  DIMIFFAEGFHGDSTP-FDG  EGGFLAHAYFPGPNIGGD-T 
^E'MMPhliman  PLVFQEVPYEDIRLRRQKEA  DIMVLFASGFHGDSSP-FDG  TGGFLAHAYFPGPGLGGD-T 
xrS*w!/n  LUman  PLTFEEVPYSELENGK-RDV  DIPIIFASGFHGDSSP-F[)G  EGGFLAHAYFPGPGIGGD-T 

t!U 013X1  PLNFHEVA - GSTA  DIQIDFSKADHNDGYP-FDA  -RRHRAHAFFPGHHHTAGYT 

MMP19  human  PLTFQEV - QAGAA  DIRLSFHGRQSSYCSNTFDG  PGRVLAHADI PELG - SV 

CoIIagenase-4  frog  PLTFTRI - YNEVS  DIEISFTAGDHKDNSP-FDG  SGGILAHAFQPGNGIGGD-A 

PM^DMu"fWt  PLTFTQI - YYGTA  DIQISFGAREHGDFNP-FDG  PYGTLAHAFAPGTGIGGD-A 

CMMP  chicken  PLIFARI - QEGIA  DIMVA FGTKAHGHC PRYFDG  PLGVLAHAFPPGSGFGGD-V 

Enamelysin  pig  PLSFVRV - NAGEA  DIMISFETGDHGDSYP-FDG  PRGTLAHAFAPGEGLGGD-T 

>.  .. /fntnVC  yS1,n  GLSFREV - P - DTTSV  DIRIKFGSYDHGDGIS-FDG  RGGVLAHAFLPRNG - DA 

M  6  SVDFREIPPDLV - TKQPP  DIYIAFEKGEHSDGFP-FDG  QDGWAHAFYPRDG - RL 

XMMP  frog  PLDFEEDNTSPL - SQI  DIKLGFGRGRHLGCSRAFDG  SGQEFAHAWFLGD - 1 

MMP  cress  ALNFTLSE - SFSTS  DITIGFYTGDHGDGEP-FDG  VLGTLAHAFSPPSG - KF 

@  6  %B  B  B  e 

YYYYYYYYYY  JJJJJJJ  XXXXXXXX 

Fibronectin  type-II  -  like  domain _ 


GNADGEYCKFPFLFNGKEYN  SCTDTGRSDGFLWCSTTYNF  EKDGKYGFCPHEALFTMGGN 


GNADGAACHFPFIFEGRSYS  ACTTDGRSDGLPWCSTTANY  DTDDRFGFCPSERLYTRDGN 


MMPI  human 
MMP2  human 
MMP3  human 
MMP7  human 
MMP8  human 
MMP9  human 
MMP10  human 
MMPI  I  human 
MMP12  human 
MMPI 3  human 
MTl-MMP  human 
MT2-MMP  human 
MT3-MMP  human 
MT4-MMP  human 
MMP19  human 
Collagenase-4  frog 
MMP  newt 
CMMP  chicken 
Enamclysin  pig 
Envelysin 
MMP  nematode 
XMMP  frog 
MMP  cress 


PRADVDHAIEKAFQLWSNVT 
DPETVDDAFARAFQVWSDVT 
.  PKDAVDSAVEKALKVWEEVT 
PHITVDRLVSKALNMWGKEI 
SEAEVERAIKDAFELWSVAS 
PRAVIDDAFARAFALWSAVT 
PRDAVDSAIEKALKVWEEVT 
VQEQVRQTMAEALKVWS  DVT 
NREDVDYAIRKAFQVWSNVT 
THS E VEKAFKRAFKVWS  DVT 
GEYATYEAIRKAFRVWESAT 
GWYHSMEAVRRAFRVWEQAT 
G DPETRKAI RRAFDVWQNVT 
GHDTVRALMYYALKVWSDIA 
P PHTARAALRQAFQDWSN VA 
PQADVETAIQRAFKVWS  DVT 
ARADVDTAIQKA  FKVW  S  DVT 
SKEWDKAIQKAFKARSTVT 
T  PAE VDKAMEMALQAW  S  S A V 
NQNDVKDEIRRAFRVWDDVS 
-VFIVRNTMHRAFNEWSTVS 
SINEQRYVFRLAFRMWSEVM 
— EEVKSVFSRAFGRWSDVT 


HFDEDERWTNN - 

HFDDDELWTLGEGQWRVKY 

HFDDDEQWTKD - 

HFDEDERWTDG - 

HFDAEETWTNT - 

HFDDDELWSLGKGVWPTRF 

HFDDDEKWTED - 

HFD YDETWT IGDDO - 

HFDEDEFWTTH - 

HFDDDETWTSS - ■— 

HFDSAEPWTVRNEDL - 

HFDADEPWTFSSTDL - 

HFDSDEPWTLGNPNH - 

HFNDDEAWTFRSSDA - 

HFDEDEFWTEGTY - 

H  FDE  DETWTKT - 

HFDEDEKWSKV - 

HFDEDEDWTMG - 

HFDNAEKWTMG - 

HFDDSETWTEGTR - 

HFDAEEQWSLNSV - 

HFDDDEHFTAPS - 

HLDADENWWSGDL - 

@  %  % 

$$$$ 


AEGQPCKFPFRFQGTS  YDSC 


ADGKPCQFP  FI FQGQS  YS AC 
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MMP1  human 
MMP2  human 
MMP3  human 
MMP7  human 
MMPS  human 


_ _ Fibronectin  type-II  -  like  domain  continued _ 

TTEGRTDGYRWCGTTEDYDR  DKKYGFCPETAMSTV-GGNS  EGAPCVFPFTFLGNKYESCT 


.  |,uman  TTDGRSDGYRWCATTANYDR  DKLFGFCPTRADSTVMGGNS 

MMP10  human  -  - 

MMPI1  human  -  _ - _ - 

MMP12  human  _ _ _  _ 

MMPI3  human  -  - 

MTJ-MMP  human -  - 

MT2-MMP  human  -  - 

MT3-MMP  human  - - 

MT4-MMP  human _ _ _ _ _ _ 

MMPI9  human  - - 

Collagenasc-4  frog _ _  _ 

MMP  newt - - - - 

CMMP  chicken  -  - 

Enamelysin  pig - - 

Envclysin - - - - - - 

MMP  nematode - - 

XMMPfrog -  - II 

MMP  cress - - 


AGELCVFPFTFLGKEYSTCT 


SAGRSDGKMWCATTANYDDD 


SEGRGDGRLWCATTSNFDSD 


MMP1  human 
MMP2  human 
MMP3  human 
MMP7  human 
MMP8  human 
MMP9  human 
MMP  10  human 
MMP1 1  human 
MMP12  human 
MMP13  human 
MT1-MMP  human 
MT2-MMP  human 
MT3-MMP  human 
MT4-MMP  human 
MMP19  human 
Collagenase-4  frog 
MMP  newt 
CMMP  chicken 
Enamelysin  pig 
Envclysin 
MMP  nematode 
XMMP  frog 
MMP  cress 


Catalytic  domain  (catalytic  Zn2+  -binding  subdomain) 


MMP  I  human 
MMP2  human 
MMP3  human 
MMP7  human 
MMP8  human 
■  MMP9  human 
MMPIO  human 
MMP1 1  human 
MMP12  human 
MMP  13  human 
MTl-MMP  human 
MT2-MMP  human 
MT3-MMP  human 
MT4-MMP  human 
MMP19  human 
CoIIagenase-4  frog 
MMP  newt 
CMMP  chicken 
Enamelysin  pig 
Envelysin 
MMP  nematode 
XMMP  frog 
MMP  cress 

Figure  5.  ( continued) 


- FR-EYNLHRVAAHE  LGHSLGLSHSTDIGALMYPS  Y-T-F — SGDVQLAQDDIDG 

RKWGFCPDQGYSLFLVAAHE  FGHAMGLEHSQDPGALMAPI  YTY - TKNFRLSQDDIKG 

- tt-gtnlflvaahe  ighslglfhsantealmypl  YHS-LTDLTRFRLSQDDING 

- SSLGINFLYAATHE  LGHSLGMGHSSDPNAVMYPT  YG — NGDPQN FKLS QDDI KG 

- SA-NYNLFLVAAHE  FGHSLGLAHSSDPGALMYPN  Y-A-FRETSNYSLPQDDIDG 

KKWGFCPDQGYSLFLVAAHE  FGHALGLDHSS V PEALM YPM  YRF - TEGPPLHKDDVNG 

- as-gtnlflvaahe  lghslglfhsantealmypl  YNS-FTELAQFRLSQDDVNG 

- GTDLLQVAAHE  FGHVLGLQHTTAAKALMSAF  YT - FRYPLSLSPDDCRG 

- SG-GTNLFLTAVHE  IGHSLGLGHSSDPKAVMFPT  YK — YVDINTFRLSADDIRG 

- SK-GYNLFLVAAHE  fghslgldhskdpgalmfpx  y-t-ytgkshfhlpdddvqg 

- hgndiflvavhe  lghalglehssdpsaimapf  yq — WMDTENFVLPDDDRRG 

- HGNNLFLVAVHE  LGHALGLEHSSNPNAIMAPF  yq — WKDVDNFKLPEDDLRG 

DGNDLFLVAVHE  LGHALGLEHSNDPTAIMAPF  YQ — YME-QTLQLPNDDYR- 

- hgmdlfavavhe  FGHAIGLSHVAAAHSIMRPY  YQGPVGDPLRYGLPYEDKVR 

- RGVNLRIIAAHE  VGKALGLGHSRYSQALMAPV  YEG Y - RPH FKLHP DDVAG 

- SE-IYNLFLVAAHE  FGHSLGLSHSTDQGALMYPT  Y-S-NTDPKTFQLPQDDINA 

- ST-GTNLFLVAAHE  FGHSLGLSHSNDRNALMFPT  Y-S - YTDPARFRLPKDDING 

- SD-GFNLFLVAAHE  VGHALGLSHPNDQRAFMFPN  Y-A-YISPSEFPLSPDDISG 

- MN-GFNLFTVAAHE  FGHALGLAHSTDPSALMYPT  YK — YQNPYGFHLPKDDVKG 

SGTNLFQVAAHE  FGHSLGLYHSTVRSALKYPY  YQ - GYVPNFRLDNDDIAG 

- EGVNLFQTAVHE  IGHLLGLEHSMDVRAAMFAA  KR - PYDPAFTLGDDDVRA 

- -SEHGISLI.KVAAHE  IGHVLGLSHIHRVGSIMQPN  Y - 1 PQDSGFELDLSDRRA 

-DSFLSVTAAVDLESVAVHE  IGHLLGLGHSSVEESIMYPT  IT - TGKRKVDLTNDDVEG 

#  #  # 

$$$$$  JJ  JZZZZZZZZZZZ 

_ Hinge,  continued _ _ 


DLGTGPT - PTLGPVTP- 


-PIGPQ  T- 


PTTTTPQPTAPPTVCPTGPP 
- QPWPT 


- fptkmppqp 

VTPRRPGRPDHRPPRPPQPP 
VPPHRS I PP — ADPRKNDR- 
- SPTAQPEEP 


TTTRATT - 

EDPVTTVKPISKEEGIDEEN 


- TPLVP 

- K - 

- PTGPS 

TVHPSERPTAGPTGPPSAGP 
- EPLVP 

- vtsrtpalg 

- LPNPD 

- PKHPK 

- RTTSRPSVP 

- PPGGKPERP 

- PKPPRPPTG 

- PLLPEPPDN 

- RDEE 

- PTGPS 

- QTPPP 

- KRPTV 

- KPTVP 

- TRRTTTTRA 

NLAKVINI IQRIE - 


TEPVPPEPGT- 


T - p 

TGPPTAGPSTATTVPLSPVD 

TKSVPSGSEM - p 

-PQAGIDTNEIAPLEPDAPP 

NSE - p 

T - p 

DKPKNPT - YGP 

PKPGPPVQPRATERPDQYGP 

-RPSYPGAK - p 

RSSAPPRKD - VP 

EEETELPTVPPVPTEPSPMP 

- - - 

TKPAL - q 

PTS - p 

HGPPHNPS - LP 

TTTRATTTTTTSPSRPS  PPR 


TKHQRDTGGFSAAWRI DGSS  RSTIVSLLLSTVGLVLWFLP 


-CEGPFDTAFD 


r  ,  Hinge 

IQAIYGRSQNPVQ - 

IQELYGASP — DI - 

IQSLYGPPPDSPE - 

IQKLYGKRSNSRK - 

IQAIYGLSSNPIQ - 

IRHLYGPRPEPEPRP - 

IQSLYGPP PASTE - 

VQHLYG - 

IQSLYGDPKENQR - 

XQSLYGPGDEDPN - 

IQQLYGGESG - 

iqqlygtpdgqpqptqplpt 

-HQRYMS  P  DKI PP PTRPL PT 

VWQLYGVRESV - 

'iQALYGKKSPVI - 

IQYLYGKSSNPVQ - 

IQAIYGPSRKPSP - 

IQSIYGSATKTPG - 

IQALYGPRKTFTG - 

XRSLYGSNSGSGTTTTTRRP 

IRSIiFPINETDANSGSEENS 

IQNLYGS - 

IQYLYGANPNFNGTTSPPST 


- 

KACDSKXTFDAITTIRGEVM 

EICKQDIVFDGIAQIRGEIF 

ANCDPALSFDAVSTLRGEIL 


KPCDPSLTFDAXTTLRGEIL  - 
DACNVN-I FDAIAEIGNQLY 
AKCDPALS  FDAISTLRGE YL 
DACEAS — FDAVSTIRGELF 
ALCDPNLS  FDAVTTVGNKI F 
DKCDPSLSLDAITSLRGETM 
NICDGN — FDTVAMLRGEMF 
NICDGD — FDTVAHLRGEMF 
NICDGN — FNTLAILRREMF 
HRCSTH — FDAVAQIRGEAF 
DPCSSELD-AMMLGPRGKTY 
SRC  DPNWFNAVTTMRGEL I 
SYCDPAIRWDAITTLRNEIL 
NTCGPQX S  FDAVTTLRREVI 
DICDSSSSFDAVTMLGKELL 
RACS — GSFDAWRDSSNRI 


WIHKEKNQYGELV-VRYNTY 
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Hemopexin-like  domain 


l222£urnan  ffkdrfymr-tnpf-ype-v 

MMP2  human  FFKDRFXWRTVTPRDKPM— 

w^2?u,nan  IFKDRHFWR-KSLR-KLE-P 
MMP7  human  - 

MMPJJ  human  ffkdryfwr-rhpq-lqrZv 
Jluman  lfkdgkywrfsegrgsrp-q 
MMP  O human  FFKDRYFWR-RSHW-NPE-P 
ww^  i?Uman  FftKAGFVWRL-RGGQLQP-G 

wwn man  ffkdrffwl-kvse-rpk-t 

MMPI3  human  IFKDRFFWR-LHPQ-qvd-a 
MTI-MMP  human  VFKKRWFWRV-RNNQVMD-G 
MT2-MMP  human  VFKGRWFWRV-RHNRVLD-N 
MT3-MMP  human  VFKDQWFWRV-RNNRVMD-G 
MT4-MMP  human  FFKGKYFWRLTRDRHLVS-L 

MMP19  human  AFKGDYVWTVSDSGPGP - 

Collagcnase-4vfrog  FFVKRFLWR-KHPQ-ASE--A 
MMPnewt  FFNGRTFLR-SMPH-TGR-I 
CMMP  chicken  FLKGRHLWR-VYPD-NSE-V 
Enamelysin  pig  FFRDRIFWR-RQVHLMSG-I 
Envclysin  YALTGPYFWQLDQPSPSW-G 
MMP  nematode - 


XMMP  frog  FFRNSWYWMYENRSNRTRYG 
MMP  cress  _ 


ELNFISVFWPQLP - NGLE  AAYEFADRDEVRFFKGNKYW 

GPLLVATFWPELP - EKID  AVYEAPQEEKAVFFAGNEYW 

ELHLISSFWPSLP - SGVD  AAYEVTSKDLVFIFKGNQFW 

EMNFIS LFWPSLP - TGIQ  AAYEDFDRDLI FLFKGNQYW 

GPFLIADKWPALP - RKLD  SVFEEPLSKKLFFFSGRQVW 

EFHLISAFWPSLP - SYLD  AAYEVNSRDTVFIFKGNEFW 

YPALASRHWQGLP - SPVD  AAFED-AQGHIWFFQGAQYW 

SVNLISSLWPTLP - SGIE  AAYEI EARNQVFLFKDDKYW 

ELFLTKSFWPELP - NRID  AAYEHPSHDLIFI FRGRKFW 

Y PMP IGQFWRGLP - ASIN  TAYER-KDGKFVFFKGDKHW 

YPMPIGHFWRGLP - GDIS  AAYER-QDGRFVFFKGDRYW 

YPMQITYFWRGLP - PSID  AVYEN-SDGNFVFFKGNKYW 

QPAQMHRFWRGLPLHLDSVD  AVYERTSDHKIVFFKGDRYW 

-LFRVSALWEGLP - GNLD  AAVYSPRTQWIHFFKGDKVW 

ELMFVQAFWPSLP - TNID  AAYENPITEQILVFKGSKYT 

ISYTISAVWPSLP - SGIH  AAYENQQKDQVLL FKGNK YW 

ELELISAFWPFLP - SGIQ  AAYENM-KDRI LFFKGNNFW 

RPSTITSSFPQLM - SNVD  AAYEVADRGKAYFFKGPHYW 

LVSNRFGFGLPQNIDASFQR  GVVTYFFSECYYYYQTSTQR 

DPLAIANGWHGIPVQNIDAF  VHVWTWTRDASYFFKGTQYW 


- AVQGQNVLHGYP 

- iysaSTlergyp 

- airgnevragyp 

- ALSGYDILQGYP 

- VYTGASVLG — P 

- AIRGNEVQAGYP 

- VYDGEKPVLG-p 

- LISNLRPEPNYP 

- alngydilegyp 

- VFDEASLEPGYP 

- LFREANLEPGYP 

- VFKDTTLQPGYP 

- VFKDNNVEEGYP 

- ryinfkmspgfp 

- ALDGFDWQGYP 

- AMKGYQMLPNYP 

- VVSGYKVLLGYP 

- ITRGFQMQG — P 

- NFPRIPVNRKWV 

RYDSENDKAYAEDAQGKSYP 


MMP1  human 
MMP2  human 
MMP3  human 
MMP7  human 
MMP8  human 
MMP9  human 
MMP10  human 
MMP1 1  human 
MMP  12  human 
MMP13  human 
MT1-MMP  human 
MT2-MMP  human 
MT3-MMP  human 
MT4-MMP  human 
MMP  19  human 
Collagenase-4  frog 
MMP  newt 
CMMP  chicken 
Enamelysin  pig 
Envelysin 
MMP  nematode 
XMMP  frog 
MMP  cress 


Hemopexin-like  domain,  continued 


FGFPRTVKHI DAAL  SEEN  TGKTYFFVANKYWR  YDEYKRSMDPGYPKMIAHDF  PGIG _ HKVDRV  -pm' 

Brrr^^n^VQRVDRAr  NWSK__NKKTYIFAGDKFWR  YNEVKKKMDPGFPKLIADAW  NAIP - DNLDAWDLO 

RGIH-TLGFPPrVRKIDAAI  SDKE--KNKTYFFVEDKYWR  FDEKRNSMEPGFPKQIAEDF  PGID _ SKIDAV^FE 


KDI-SNYGFPSSVQAIDAAV 
RRLD-KLGLGADVAQVTGAL 
RGIH^TLGFPPTIRKIDAAV 
APLT-EL — GLVRFPVHAAL 
KS IH - S FGFPNFVKKI DAAV 
KKIS-ELGLPKEVKKISAAV 
KHIK-ELGRGLPTDKIDAAL 
QPLT-S YGLGI PYDRIDTAI 
HDLI-TLGSGIPPHGIDSAI 
RPVS-DF — SLPPGGIDAAF 
KKLN - RVEPNLDAAL 

rniy-slgfpktvkridaav 

QN I Y-TLGLPRT VTRIDAAV 
KN IN -TLGFPKGVKKI DAAV 
PRT I YD FGFPRYVQRI DAAV 
- GLPCNIDA 


FYR - SKTYFFVNDQFWR 

RSGR — G-KMLLFSGRRLWR 
SDKE — KKKTYFFAADKYWR 
VWGPEKN-KIYFFRGRDYWR 
FNPR — FYRTYFFVDNQYWR 
HFED — TGKTLLFSGNQVWR 
FWMP — NGKTYFFRGNKYYR 
WWEP — TGHTFFFQEDRYWR 
WWED — VGKTYFFKGDRYWR 
S WAH — NDRTYFFKDQLYWR 
YWP — LNQKVFLFKGSGYWQ 
HIEQ — LGKTY  FFAAKK  YW  S 
YHPD — TRKTYYFVNDKYWS 
CNKN — TGKT  DFFVGDKYWR 
HLKD — TQKTLFFVGDEYYS 
VYRS  S -RGPT YFFKDS FVYK 


YDNQRQFMEPGYPKSISGAF 

FDVKAQMVDPRSASEVDRHF 

FDENSQSMEQGFPRLIADDF 

FHPSTRRVDSPVPRRAT-DW 

YDERRQMMDPGYPKLITKNF 

YDDTNHIMDKDYPRLIEEDF 

FNEELRAVDSEYPKNIK-VW 

FNEETQRGDPGYPKPIS-VW 

YSEEHKTMDPGYPKPIT-VW 

YDDHTRHMDPGYPAQSP-LW 

WDELARTDFSSYPKPIKGLF 

YDEDKKQtfDKGFPKQISNDF 

FDEALQVHDKDSPQQIVTTF 

YDESTQSMEKGYPRRTVNDF 

YDERKRKMDKDYPKNTEEEF 

FNSNNRLQRRTRISSLFNDV 


PGIE - SKVDAV — FQ 

PGVP - LDTHDVFQYR 

PGVE - PKVDAV — LQ 

RGVP - SEIDAAF-QD 

OGIG - PKIDAVF-YS 

PGIG - DKVDAV — YE 

EGIP - ESPRGSFMGS 

QGip - ASPKGAFLSN 

.KG  IP - ESPQGAFVHK 

RGVP - STLDDAMRWS 

TGVP - NQPSAAMSWQ 

PGIP - DKIDAA — FY 

PRIG - TKVDAV — FY 

PGIS - QRIDAV — FQ 

SGVN - GQIDAA — VE 

PSAL - HDGVEAWRA 


F--DRRRQYIYFFRDSQVFA  FDINRNRVAPDFPKRILDFF  PAVAANNHPKGNI DVAYYS Y 


MMPI  human 
MMP2  human 
MMP3  human 
MMP7  human 
MMP8  human 
MMP9  human 
MMP  10  human 
MMP II  human 
MMP12  human 
MMP13  human 
MT1-MMP  human 
MT2-MMP  human 
MT3-MMP  human 
MT4-MMP  human 
MMP19  human 
Collagenase-4  frog 
MMP  newt 
CMMP  chicken 
Enamelysin  pig 
Envelysin 
MMP  nematode 
XMMP  frog 
MMP  cress 

Figure  5.  ( continued) 


Hemopexin-like  domain,  continued 


Linker 


KDGFFYFFHGTRQYK-FDPK  TKRILTLQKANS - - WFNCRKN _  _ _ 

GGGHSYFFKGAYYLK-LENQ  S - LKSVK-FGSIK  S-DWLGC - 

EFGFFYFFTGSSQLE-FDPN  AKKVTHTLKSNS - - WLNC _  _ 11111111111  I 


QEHFFHVFSGPRYYA-FDLI 

EKA--YFCQDRFYWR-VSSR 

AFGFFYFFSGSSQFE-FDPN 

ADGYAYFLRGRLYWK-FDPV 

KNKYYYFFQGSNQFE-YDFL 

KNGYIYFFNGPIQFE-YSIW 

DEVFTYFYKGNKYWK-FNNQ 

DAAYTYFYKGTKYWK-FDNE 

ENGFTYFYKEGVLEI-QTTR 

DGA-SYFFRGQEYWK-VLDG 

d-grvyffkgkvywr-ln-q 

YRGRLYFFIGRSQFE-YNIN 

AKGLLYFFNGQHQFE-FNMR 

HKGLFYFFHGSRQLK-FDPT 

LNGYIYFFSGPKAYK-YDTE 

DRNYIHFYRDGRYYR-MTDY 


AQRVTRVARGNK - - WLNCRYG -  ZZZZZZZZZIIZZ 

S - ELNQVDQVGYVT  Y-DILQCPED - ' _  _  _  ‘ 

ARMVTHILKSNS - - WLHC _  _ 

KVKALEGFPRLVGP - — DFFGCAEPA NTFL —  _  _  II 

LQRITKTLKSNS - - WFGC -  - 

SNRIVRVMPANS - - ILWC _  ZZZZZZZZZIIZIZZ 

KLKVEPGYPKSALR - —  DWMGC - PSGGRPD-  - EGTEEETEV 

YSRLEPGHPRST^K - DFMGCQEHVEPGPRWPDV  ARPPFNPHGGAEPGADSAEG 

YSRLEPGHPRS I LK -  DLSGCDGPTDRVKEG - - HSPPDDVDT 

QLRVEKGYPRNTSH - DWLVCGDSQADGSVAAGV  DAAEGPRAPPGQHDQSRSE- 

S™::::::  "mtoituotlsatett 

LKKVTRVLKKSS - - WFSC _  _ 

AKRVISEIKSNS - - WFNC _  ZZZZ  ZZ  Z 

KEDVVSVLKSNS - - WIGC _  Z_ZZZZZZZZZZ_Z _ Z 

GRQFVN  FPNG  LPYSDVIES-  — VIPQCRGRSLSYESEGCS  NSSE _ ZZ _ ZZZ 


tysslflfkgkefwkvvsdk  drrqnpslpynglfprrais  qqwfdicnvhpsllki _  ZZZZZZZZZZZZZZZZZZZZ 
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MMP1  human 
MMP2  human 
MMP3  human 
MMP7  human 
MMP8  human 
MMP9  human 
MMPIO  human 
MMPI 1  human 
MMP12  human 
MMPI 3  human 
MT1-MMP  human 
MT2-MMP  human 
MT3-MMP  human 
MT4-MMP  human 
MMP19  human 
CoMagenase-4  frog 
MMP  newt 
CMMP  chicken 
Enamelysin  pig 
Envelysin 
MMP  nematode 
XMMP  frog 
MMP  cress 


Transmembrane  linker  ^  Cytoplasmic  region 


IIIEVD - - - - EEGGGAVSAAAWLP  VLLLLLVLAVGLAVFFFRRH  GTPRRLLYCQRSLLDKV 

DVGDGDGDFGAGVNKDGGSR  VWQMEEVARTVNWMVLVP  LLLLLCVLGLTYALVQMQRK  GAPRVLLYCKRSLQEWV 

VI - * - - KLDNTASTVKAIAI VIP  CILALCLLVLVYTVFQFKRK  GTPRHILYCKRSMQEWV 

- - DGYE  VCSCTSGASSPPGAPGPLVA  ATMLLLLPPLSPGALWTAAQ  ALTL - - — - 


FEY- 


Figure  5.  ( continued ) 


that  the  catalytic  domains  of  MMP-2  and  MMP-9  (ge- 
latinases)  are  not  clustered  together.  Also,  unlike  se¬ 
quence  analysis  for  the  entire  enzymes,  sequences  for 
the  catalytic  domains  did  not  sequester  according  to 
the  location  of  the  respective  genes  within  the  given 
chromosome.  These  observations  collectively  argue 
for  the  fact  that  the  catalytic  domains  of  these  MMPs 
likely  evolved  in  parallel,  indicating  that  the  selection 
pressure  for  the  catalytic  domain  was  distinct  in  the 
course  of  the  diversification  of  this  family  of  enzymes. 

Three-dimensional  structures  of  the  catalytic 
domains  of  the  23  representative  MMPs 

The  unique  features  of  3-dimensional  structures  of 
these  enzymes  were  studied  by  developing  computa¬ 
tional  models  for  the  23  catalytic  domains.  The  crystal 
structures  of  the  catalytic  domains  of  four  MMPs 
(MMP-1,  MMP-3,  MMP-7,  and  MMP-8)  have  been  elu¬ 
cidated  (26-29).  We  modeled  the  catalytic  domain 
structures  of  the  remaining  19  representative  en¬ 
zymes  on  the  basis  of  the  similar  fold  of  the  proteins 
that  have  been  crystallized  (30,  31;  coordinates  for 
the  structures  of  the  modeled  MMP  catalytic  domains 
can  be  obtained  from  our  group  web  page:  http:// 
sun2.sdence.wayne.edu/ ~ somgroup).  Closer  exam¬ 
ination  of  the  structures  of  the  catalytic  domains  re¬ 
vealed  that  a  conserved  aspartic  acid  is  found  in  the 
vicinity  of  the  methionine  turn,  the  side  chain  of 
which  is  buried  inside  the  core  of  the  domain.  The 
only  variation  to  this  pattern  is  seen  in  the  rabbit 
MT1-MMP,  which  has  a  glutamic  acid  in  this  position. 
The  two  side  chain  oxygens  of  the  aspartic  acid  form 
two  critical  structural  hydrogen  bonds  to  the  back¬ 
bone  amides,  one  with  the  methionine  of  the  methi¬ 
onine  turn  and  another  with  the  residue  preceding 
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the  methionine.  The  conservation  of  this  pattern/ 
motif  in  all  64  known  MMPs  argues  that  these  three 
histidines  are  absolutely  required  structural  elements 
for  the  precise  positioning  of  the  catalytic  zinc  ion 
for  effective  catalysis.  These  three  amino  acids  are 
marked  with  the  pound  sign  (#)  in  Fig.  5. 

We  investigated  the  nature  of  two  additional  zinc 
and  calcium  binding  sites  formed  by  fkstrands  and 
turns  in  the  proximity  of  the  catalytic  zinc  in  all  the 
MMPs  (31).  The  64  MMPs  developed  at  least  four 
different  ways  to  bind  to  this  structural  zinc  ion.  For 
the  majority  of  MMPs  (60  enzymes  representing  20 
subfamilies  of  the  total  of  23  identified  by  us),  this 
site  is  provided  by  the  side  chains  of  an  aspartic  acid 
and  three  histidines  (marked  as  an  in  Fig.  5).  The 
signature  for  the  binding  site  of  the  structural  zinc 
and  the  calcium  ion  in  the  60  MMPs  is  JTs-fGNJ-D- 
X  ( 2 )  -  [PAS]  -F-D-  [  GA]  -X  (4)  -  [LIRV]  -  [AG]  -H5-  [AV]  - 
[FYS]  -P-X (5,7,9) -ffe-[FL] -D-X(2)-E-X-W.  The  letters 
in  bold  italics  represent  residues  that  provide  side 
chains  for  coordination  to  the  structural  zinc  and  cal¬ 
cium;  X(5,  7,  9)  indicates  five,  seven,  or  nine  vari¬ 
able  residues  in  between  the  flanking  sites.  The 
.  .  .FYS.  .  .  region  in  this  pattern  is  close  to  the  co¬ 
ordinated  histidines,  a  result  of  the  enzyme  fold.  The 
presence  of  these  hydrophobic  residues  creates  an 
increased  hydrophobic  environment  and  enhances 
the  binding  affinity  for  the  metal  ions.  The  human 
MMP-1 1  has  an  aspartic  acid  instead  of  one  of  His5, 
which  is  different  from  the  MMP-1 1  from  rabbit  and 
mouse,  which  still  have  a  histidine  at  this  position, 
suggesting  that  MMP-1 1  (comparing  those  from  hu¬ 
man,  rabbit,  and  mouse)  does  not  possess  a  unique 
motif  for  binding  of  the  structural  zinc.  The  chicken 
CMMP,  the  frog  XMMP,  and  the  human  MMP-1 9 
form  a  separate  subfamily  in  the  multiple-sequence 
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dendrograms  and  each  has  a  unique  motif  for  coor¬ 
dination  to  the  structural  zinc  ion  (31).  The  chicken 
CMMP  has  a  histidine  present  where  the  majority  of 
MMPs  have  an  aspartic  acid  coordinated  to  the  sec¬ 
ond  zinc  ion.  A  cysteine  residue  (Cys174)  adjacent  to 
the  fourth  coordinated  histidine  implicates  two  pos¬ 
sibilities  for  the  binding  mode  of  the  structural  zinc 
in  CMMP.  This  cysteine  may  either  provide  a  site  for 
protein  dimerization  or  another  binding  mode  for 
the  second  zinc  ion  in  CMMP  (three  original  histi¬ 
dines  and  the  cysteine  coordinated  to  the  zinc  ion) . 
This  cysteine  residue  is  also  present  in  the  sequence 
of  the  frog  XMMP  at  the  same  position.  The  struc¬ 
tural  zinc  coordination  in  MMP-19  is  seen  with  two 
histidines  at  usual  positions  and  one  cysteine  at  the 
corresponding  position  as  discussed  for  CMMP  and 
XMMP,  and  the  third  histidine  four  residues  toward 
the  amino  terminus.  It  has  a  noncoordination  serine 
at  the  position  where  most  MMPs  have  the  third  his¬ 
tidine  (Fig.  5).  The  position  of  the  third  histidine  is 
on  a  different  p-strand,  and  the  orientation  of  the 
side  chain  for  the  fourth  coordination  site  is  perfecdy 
acceptable.  Hence,  the  variations  noted  in  the  coor¬ 
dination  to  structural  zinc  ion  may  be  indicative  of 
the  different  outcomes  for  selection  of  novel  enzymic 
activities.  Furthermore,  the  dendrogram  shown  in 
Fig.  3  indicates  that  these  variations  in  the  zinc  bind¬ 
ing  motif  came  about  as  a  consequence  of  indepen¬ 
dent,  unrelated  evolutionary  processes. 

Calcium  ion  coordinates  with  six  elements  in  the 
3-dimensional  structures  of  catalytic  domains  of 
MMPs,  which  come  into  close  proximity  for  coordi¬ 
nation  as  an  octahedron.  In  all  64  MMPs,  three  of 
these  elements,  which  are  the  side  chains  of  amino 
acid  residues,  are  conserved.  These  are  two  aspartic 
acids  and  one  glutamic  acid  (marked  by  ‘%*  in  Fig. 
5).  A  minor  variation  is  seen  only  for  the  human 
MT4-MMP,  which  has  an  asparagine  residue  instead 
of  the  second  aspartic  acid.  The  other  three  calcium 
ligands  are  provided  by  the  backbone  carbonyl  oxy¬ 
gens  of  three  residues  within  a  turn.  Positions  are 
marked  by  the  letter  ‘B’  in  Fig.  5.  The  spatial  location 
of  the  catalytic  zinc  ion,  structural  zinc  ion,  and  the 
calcium  ion  are  shown  in  Fig.  6. 

Multiple-sequence  analysis  of  regions  encoding  the 
hemopexin-like  domain  of  MMPs 

Figure  4  shows  the  results  of  multiple-sequence  align¬ 
ment  for  the  hemopexin-like  domains  by  themselves. 
This  domain  is  absent  in  MMP-7  (matrilysin) ,  in  all 
known  plant  MMPs,  and  in  the  nematode  MMP.  The 
sequences  of  the  hemopexin-like  domains  of  inver¬ 
tebrate  MMPs  are  the  least  related  to  all  other  MMPs, 
which  indicates  their  ancient  origin.  In  some  MMPs, 
the  hemopexin-like  domains  have  been  shown  to  fa¬ 
cilitate  binding  and  denaturation  of  the  macromo- 
lecular  substrates;  it  would  be  interesting  to  correlate 


clustering  schemes  for  the  hemopexin-like  domains 
of  MMPs  to  their  substrate  specificities.  Because  of 
the  multiplicity  of  the  known  substrates  for  some  of 
these  enzymes  and,  in  contrast,  the  paucity  of  any 
information  on  substrates  for  other  MMPs,  it  is  ex¬ 
tremely  difficult  to  draw  any  substantial  conclusions 
on  this  issue.  On  the  other  hand,  one  cannot  help 
but  notice  the  diversification  of  the  hemopexin-like 
domains  seen  in  these  proteins;  insofar  as  this  do¬ 
main  is  clearly  linked  to  the  issue  of  substrate  speci¬ 
ficity,  the  diversity  in  this  domain  for  the  various 
MMPs  indicates  different  evolutionary  tangents  pur¬ 
sued  by  these  functionally  distinct  enzymes. 

Murphy  and  Knauper  (32)  recently  reviewed  the 
role  of  the  hemopexin-like  domains  in  relation  to  the 
substrate  specificities  and  activities  of  various  MMPs. 
It  was  suggested  that  the  hemopexin-like  domains 
mediate  binding  of  MMP-1,  MMP^8,  MMP-13,  and 
MMP-3  to  collagen  and  that  the  complex  participates 
in  the  cleavage  of  triple  helical  collagen.  In  the  case 
of  MMP-2  and  MMP-9,  the  hemopexin-like  domain  is 
important  for  interactions  with  TIMPs,  although  the 
high  degree  of  sequence  similarity  and  the  likely 
structures  of  the  hemopexin-like  domains  of  the  ge- 
latinases  suggest  there  is  a  high  degree  of  specificity 
in  the  binding  of  TIMPs  to  the  latent  forms  of  these 
enzymes.  For  example,  TIMP-1  binds  exclusively  to 
latent  MMP-9  (i^  -35  nM),  whereas  TIMP-2  binds 
to  latent  MMP-2  ( Kd  nM)  (43).  We  have  recently 
shown  a  biphasic  binding  of  TIMP-1  and  TIMP-2  to 
the  latent  forms  of  MMP-9  and  MMP-2,  respectively, 
with  the  hemopexin-like  domain  representing  the 
high-affinity  binding  site  (43).  Removal  of  the  he¬ 
mopexin-like  domain  of  MMP-2  decreases  the  affinity 
of  TIMP-2  for  the  active  site  without  significantly  af¬ 
fecting  enzymatic  activity.  TIMP-1,  which  efficiently 
inhibits  the  active  form  of  MMP-2,  does  not  bind  to 
a  carboxy-terminally  truncated  MMP-2  form,  dem¬ 
onstrating  the  importance  of  the  hemopexin-like  do¬ 
main  in  interactions  of  TIMP-1  with  active  MMP-2 
(43).  The  hemopexin-like  domain  of  MMP-2  has  also 
been  shown  to  play  a  role  in  zymogen  activation  by 
MT1-MMP  (44,  45).  It  has  also  been  suggested  that 
the  hemopexin-like  domain  of  MMP-2  plays  a  role  ih 
the  binding  of  the  enzyme  to  integrin  (46,  47), 
a  process  that  may  facilitate  localization  of  MMP-2  on 
the  cell  surface.  The  alignment  of  the  hemopexin- 
like  domains  shows  that  MMP-2  and  MMP-9  fall  into 
two  different  clusters.  The  hemopexin-like  domain  of 
MMP-9  diverges  at  a  higher  hierarchial  level,  impli¬ 
cating  that  it  is  somewhat  different  from  that  of  MMP- 
2  even  though  both  are  involved  in  TIMP  binding.  In 
contrast  to  gelatinases,  there  is  not  enough  biochem¬ 
ical  data  on  the  various  roles  of  hemopexin-like  do¬ 
mains  for  all  known  MMPs,  which  may  be  different 
in  each  case.  The  interesting  diversification  of  this 
domain  and  its  effect  on  the  functions  of  various 
MMPs  can  also  be  seen  in  the  dendrogram  of  Fig.  4. 


1090  Vol.  12  September  1998 


The  FASEB  Journal 


MASSOVA  ET  AL. 


30 


Figure  6.  Ribbon  drawing  of  the  modeled  3-dimensional  structures  for  the  catalytic  domains  of  MMP-12  (A)  and  MMP-19  ( B ). 
Red  spheres  represent  the  zinc  ions  and  the  green  sphere  represents  the  calcium  ion.  Catalytic  zinc  ion  is  located  at  the  center 
“  “*  catalytic  domain  and  structural  zinc  is  at  the  12  o’clock  position.  The  variable  loops  at  10,  5,  and  3  o’clock  (designated 
by  X  ,  L  ,  and  5  m  Fig.  5,  respectively)  are  shown  in  white.  The  two  models  are  similar  structurally,  except  for  the  variable 
regions  (shown  in  white) .  Figures  were  prepared  using  MOLSCRIPT  and  Raster3D  rendering  programs  (57  58) 


With  the  exception  of  envelysin  and  XMMP,  all  other 
hemopexin-like  domains  fall  under  one  cluster.  The 
hemopexin-like  domain  of  MMP-9  formed  a  cluster 
by  itself  (vide  infra) ,  whereas  those  of  MMP-1 1,  MMP- 
19,  and  the  MT-MMPs  formed  a  separate  cluster;  all 
remaining  MMPs  constitute  an  additional  cluster. 

The  hemopexin-like  domain  of  envelysin  forms  a  sep¬ 
arate  cluster  from  those  of  the  other  MMPs.  Envelysin 
degrades  the  protective  fertilization  envelope,  a  complex 
of  glycoproteins,  releasing  the  embryos  of  sea  urchin, 
although  the  individual  glycoproteins  have  not  yet  been 
identified.  Envelysin  is  also  known  to  hydrolyze  small 
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peptides  like  substance  P,  oxidized  insulin  B,  and  colla- 
genase  substrate-like  small  peptide  (48).  In  a  recent 
study,  it  was  also  shown  that  the  hemopexin-like  domain 
of  envelysin  determines  substrate  specificity  for  this  en¬ 
zyme  (24).  The  substrate  specificity  of  envelysin  is  be¬ 
lieved  to  be  similar  to  that  of  stromelysin-1,  which  also 
degrades  the  fertilization  envelope  proteins  of  grater 
than  100  kDa  From  the  position  of  envelysin  in  the  den¬ 
drogram  as  well  as  from  the  available  functional  data  on 
this  enzyme,  it  would  appear  that  the  hemopexin-like 
domain  of  this  MMP  is  distinct  from  the  rest  and  di¬ 
verged  early  from  those  of  the  other  MMPs. 
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Enamelysin  is  detected  during  the  development  of 
the  enamel  matrix  and  is  expressed  specifically  in  the 
enamel  tissue.  According  to  this  report  as  well  as  one 
by  other  investigators  (49),  enamelysin  forms  a  sep¬ 
arate  subfamily  of  MMPs.  It  is  difficult  to  correlate  its 
substrate  specificity  to  the  hemopexin-like  domain 
alignment  from  the  limited  biochemical  data  avail¬ 
able.  However,  its  position  suggests  that  it  may  be 
quite  different  from  other  collagenases.  Recently, 
MMP-20  has  been  sequenced  and  its  gene  has  been 
mapped  to  chromosome  11  (21).  Our  analysis  of  the 
complete  sequences  of  MMPs  (without  human  ena¬ 
melysin)  showed  that  pig  enamelysin  and  other 
MMPs  that  are  mapped  together  to  chromosome  11 
cluster  together  (Fig.  2 B).  In  GenBank,  another  hu¬ 
man  metalloproteinase  that  is  a  product  of  the  gene 
mmp20  has  recently  been  reported  (accession  num¬ 
ber  AJ003144);  this  enzyme  is  mapped  to  chromo¬ 
some  16.  Only  183  amino  acids  are  reported  in  its 
sequence;  it  probably  is  not  sequenced  completely 
and  is  not  included  in  our  current  analysis  (22). 

The  hemopexin-like  domain  of  MMP-12  shows 
clustering  by  itself  into  a  separate  subfamily  in  close 
relation  to  enamelysin.  Thus,  the  hemopexin-like  do¬ 
main  of  MMP-12  appears  to  have  diverged  into  a  sep¬ 
arate  subfamily  (i.e.,  specialized)  earlier  than  the 
hemopexin-like  domains  of  the  stromelysins  and 
collagenases  (MMP-1,  MMP-3,  MMP-8,  MMP-10,  and 
MMP-13,  the  exception  being  MMP-1 1),  underscor¬ 
ing  the  role  of  MMP-12  as  an  ‘elastase’.  Despite  the 
lack  of  a  hemopexin-like  domain,  MMP-7  possesses  a 
substrate  preference  similar  to  that  of  MMP-12,  sug¬ 
gesting  that  the  influence  of  the  hemopexin-like  do¬ 
main  on  substrate  interactions  is  limited,  and  other 
sites  may  play  a  role  in  determining  substrate  speci¬ 
ficity.  Indeed,  comparison  of  the  entire  sequences  of 
MMP-7  and  MMP-12  (Fig.  2)  shows  that  the  clusters 
of  these  enzymes  are  equally  remote  from  other 
MMPs,  which,  to  put  it  another  way,  are  equally  dis¬ 
tinct  from  other  collagenases,  gelatinases,  stromely¬ 
sins,  and  MT-MMPs. 

MMP-13  possesses  a  substrate  specificity  that  is 
broader  than  that  of  the  other  collagenases  such  as 
MMP-1  and  MMP-8.  The  hemopexin-like  domain  of 
MMP-13  distinguishes  itself  by  clustering  into  a  sep¬ 
arate  subfamily,  diverged  at  a  higher  level  of  hierar¬ 
chy  than  MMP-1  and  MMP-8.  The  frog  collagenase-4 
cleaves  collagen  type  I,  similar  to  MMP-1,  and  also 
possesses  a  weak  gelatinolytic  activity.  This  enzyme 
has  been  classified  as  a  different  type  of  collagenase 
due  to  its  characteristic  cleavage  pattern  of  gelatin 
(50).  This  is  supported  by  analysis  of  the  sequences 
of  hemopexin-like  domains  presented  here,  where 
the  hemopexin-like  domain  of  collagenase-4  clusters 
differently  than  those  of  other  collagenases. 

The  newt  MMPs  have  not  been  studied  extensively 
to  define  their  substrate  specificities.  However,  in  a 
sequence  comparison  study,  these  were  classified  as 


stromelysin  type'  (51).  In  our  analysis;  the  newt 
MMPs  also  fell  into  a  subfamily  comprised  of  collag¬ 
enases  and  stromelysins  (MMP-13,  MMP-1,  MMP-10, 
CMMP,  and  MMP-8).  MMP-1,  MMP-10,  MMP-3,' 
MMP-8,  and  CMMP  form  closely  akin,  independent 
clusters,  indicating  that  they  are  related,  yet  possess 
differentiated  substrate  specificities. 

The  hemopexin-like  domain  of  MMP-1 1  (strome- 
lysin-3),  along  with  that  of  MMP-19  and  MT-MMPs, 
diverged  and  formed  a  separate  subfamily.  MMP-1 1 
is  the  first  MMP  reported  to  be  activated  intracellu- 
larly  by  means  of  a  furin-like  convertase  and  has  been 
shown  to  be  unable  to  cleave  any  of  the  major  extra¬ 
cellular  matrix  components  like  other  collagenases 
and  stromelysins  (52).  The  hemopexin-like  domain 
of  MMP-1 1  clustered  separately  in  our  analysis,  sug¬ 
gesting  distinct  functional  properties.  Analysis  of  the 
role  of  the  hemopexin-like  domains  and  their  rela¬ 
tion  to  substrate  specificities,  if  any,  for  the  MT-MMP 
subfamily  is  more  complex  due  to  the  presence  of 
the  transmembrane  domain  (vide  infra) .  The  he¬ 
mopexin-like  domains  of  all  four  known  MT-MMPs 
form  a  subfamily  of  their  own.  Though  there  is  not 
enough  biochemical  data  on  MT-MMPs,  from  the 
alignment  of  the  hemopexin-like  domains  one  can 
say  that  these  domains  have  diverged  to  a  significant 
extent  from  those  of  the  other  MMPs  (except  MMP- 
H),  suggesting  that  their  functional  roles  are  differ¬ 
ent  from  the  rest  XMMP  has  not  been  studied  ex¬ 
tensively  for  natural  substrates.  However,  the  results 
of  the  alignment  of  the  complete  sequence  and  the 
hemopexin-like  domain  of  XMMP  suggest  that  the 
substrate  profile  of  XMMP  may  also  be  different  from 
those  of  the  other  MMPs. 

General  folding  of  the  catalytic  domains  of  MMPs 

In  a  previous  study  of  the  structural  aspects  of  MMP-2 
and  MMP-9  (30),  we  investigated  the  binding  modes 
of  peptide  substrates  in  the  active  sites  of  six  MMPs 
(four  crystallized  and  two  modeled).  Residues 
marked  by  the  letter  ‘J*  in  Fig.  5  provide  the  anchor¬ 
ing  interaction  to  the  backbone  elements  of  a  poten¬ 
tial  substrate.  The  general  structural  comparison  of 
the  23  representative  MMPs  revealed  four  areas  of 
topological  variability  in  the  catalytic  domains  of  the 
64  MMPs.  These  areas  are  formed  by  four  loops, 
three  of  which  are  located  in  the  vicinity  of  the  sub¬ 
strate  binding  region.  These  regions  are  marked  by 
the  letters  ‘X’,  T,  *Z’,  and  (Fig.  5).  Figure  6  shows 
the  ribbon  representation  for  two  typical  folds  found 
for  MMPs:  one  is  for  the  human  MMP-1 2  and  another 
is  MMP-19  (Fig.  6A,  B,  respectively).  The  three  vari¬ 
able  loops  (at  10,  5,  and  3  o’clock  positions  desig¬ 
nated  (in  Fig.  5)  as  X,  Z,  and  $,  respectively)  that 
could  have  contact  with  the  bound  substrates  (vide 
infra)  are  shown  in  white  in  Fig.  6.  The  Y  loop  is 
located  far  from  the  substrate  binding  area  on  the 
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catalytic  domain  and  is  not  especially  highlighted  in 
Fig.  6  (at  10  o’clock).  The  region  designated  by  the 
letter  X  (at  10  o’clock  in  Fig.  6)  is  formed  by  the  turn 
between  the  two  antiparallel  p-strands.  These  (T 
strands  provide  some  of  the  binding  ligands  for  the 
structural  zinc  and  calcium  binding  sites.  Our  previ¬ 
ous  models  for  substrate  binding  in  the  active  site  of 
MMPs  revealed  that  substrate  can  acquire  an  ex¬ 
tended  conformation  (30) .  In  such  a  binding  mode, 
the  unprimed  portions  [for  convention  on  identifi¬ 
cation  of  substrates  in  protein— substrate  complexes 
and  their  binding  sites  in  proteases,  consult  Berger 
and  Schechter  (53)]  of  substrates  would  have  con¬ 
tacts  to  the  Xloop.  This  loop  is  shorter  in  the  human 
MMP-19  (Fig.  6 B),  frog  XMMP,  envelysins,  plant 
MMPs,  and  nematode  MMP  than  in  all  other  MMPs. 
The  shorter  X  loop  makes  the  unprimed  areas  of  the 
active  site  more  open  in  these  enzymes.  The  $  loop 
has  contacts  with  the  primed  portion  of  the  substrate 
(P3'  position)  and  is  located  at  3  o’clock  in  Fig.  6. 
The  Z  region  (at  5  o’clock  in  Fig.  6)  is  an  ‘TP’  loop, 
which  forms  the  S/  binding  pocket;  in  the  crystal¬ 
lized  full-length  porcine  MMP-1,  this  loop  has  con¬ 
tacts  through  side  chains  and  bridging  water  mole¬ 
cules  to  the  hemopexin-like  domain  of  the  enzyme 
(54,  55).  The  length  of  this  loop  will  control  the  size 
of  the  residue  at  P/  position  of  the  substrate.  The 
composition  of  the  loop  will  have  an  effect  on  sub¬ 
strate  specificity  (30).  Furthermore,  binding  of  pro¬ 
tein  substrates  by  some  MMPs  is  influenced  by  inter¬ 
actions  with  the  hemopexin-like  domain  (32). 
Nonetheless,  the  specificity  of  small  synthetic  sub¬ 
strates  is  triggered  by  their  interactions  solely  with  the 
active  site  in  the  catalytic  domain  and  its  surround¬ 
ings.  This  is  probably  true  even  for  gelatinases,  since 
the  catalytic  domain  of  MMP-2  with  excised  gelatin 
binding  domain  is  still  active  in  hydrolysis  of  synthetic 
peptides  (10,  56).  Substrates  interact  with  the  loops 
designated  X,  $,  and  Z  at  unprimed  and  primed  por¬ 
tions,  and  structural  variability  of  these  loops  provide 
the  diversity  of  such  interactions. 


CONCLUSION 

The  foregoing  examined  sequence  similarities,  se¬ 
quence  alignments,  and  structural  aspects  in  arriving 
at  an  understanding  of  the  important  functions  for 
the  family  of  matrix  metalloproteinases.  Our  analysis 
of  the  structures  of  MMPs  in  view  of  their  evolution¬ 
ary  relationship  follow  the  limitations  of  the  primary 
sequence  alignment  and  the  prediction  of  the  3-di¬ 
mensional  structures,  but  present  a  distinctive  way  of 
looking  at  the  multidomain  structures  like  those  of 
MMPs.  The  fact  that  alignment  of  the  entire  se¬ 
quences  and  those  for  the  catalytic  domains  and  the 
hemopexin-like  domains  produced  essentially  the 
same  numbers  for  the  clusters  (22-23  clusters)  and 


that  the  composition  for  the  clusters  appear  to  be  the 
same  in  each  case  is  not  coincidental.  What  this  re¬ 
veals  to  us  is  the  likely  scenario  that  domain  assem¬ 
blies  occurred  in  an  early  stage  of  the  diversification 
of  these  enzymes  and  that  they  progressed  through 
the  evolutionary  process  independent  of  one  an¬ 
other,  and  perhaps  in  parallel  to  each  other.  This  fact 
does  not  divorce  itself  from  the  obvious  premise  that 
at  some  primordial  point  in  the  evolution  of  these 
enzymes  they  must  have  existed  as  simple  single-do- 
main  proteins  that  underwent  gene  fusions  to  gen¬ 
erate  the  more  complicated  multidomain  enzymes. 
This  point  is  perhaps  best  underscored  by  the  ex¬ 
amples  of  the  three  plant  MMPs  and  the  sole  enzyme 
from  nematode.  These  clearly  are  modem  variants  of 
enzymes  that  did  not  undergo  major  structural  elab¬ 
oration  in  their  development.  However,  our  analysis 
also  demonstrates  that  there  are  examples  where  evo¬ 
lution  progressed  in  the  reverse  direction:  a  more 
complicated  multidomain  enzyme  underwent  trun¬ 
cation  in  its  gene  sequence  to  give  rise  to  a  less  elab¬ 
orated  protein  of  fewer  domains.  An  example  of  this 
type  of  retrograde  process  is  matrilysin  (MMP-7), 
which  contains  only  the  signal  peptide,  the  propep¬ 
tide,  and  the  catalytic  domain. 

It  is  not  clear  how  many  more  MMPs  exist  in  na¬ 
ture,  and  our  understanding  of  the  actual  functions 
of  these  enzymes  is  now  at  a  rudimentary  stage.  As 
more  sequences  of  MMPs  become  available,  the  anal¬ 
ysis  presented  here  should  be  updated  and  correlated 
with  the  new  structural  information  that  will  be  de¬ 
termined  for  these  important  enzymes.  Nonetheless, 
the  exercise  presented  here  is  the  first  step  toward 
appreciation  of  the  evolutionary  processes  that  led  to 
the  diversification  of  these  enzymes,  with  the  at¬ 
tendant  myriad  of  activities  of  central  importance 
to  both  the  physiology  and  pathology  of  living 
organisms.  ^ 
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.  ,  ■  e  ^  calcium  ions  in  matrix  metalloproteases  (MMPs)  were 

StrUCWral  ^  ^  M  rePreSentatiVe  MMPS' 

physiological  and  pahologicel  tactions.  Thes tac  ^pendent  ondopsptidases  town  ta 

inflammation,  arthritis,  and  cancer  metastasis.  Th«e  ”,  „  ^  ptamns.  MMPs  have  developed 
ft*  ability  to  cleave  eta-cellular  mama  ^  b  «.  iuconKtoon  °f  ™rioos  P'°“"  d°m"“S 

hato  a  unique  gt.up  of  ainedependen.  ^  ^s  The  implication  of  MMPs  ,n  ctrncet 

in  their  structures,  which  mediate  interactions  with  s  ^  ^  ^  ^  ^  ^  represent  Mttaenve 

metastasis  to  angiogenesis  has  »sed  eons'  «  undemanding  of  the  structure  and  funenon  o 


Figure  1.  Stereo  *e  catalytiC 

SSTX ^^^.ncionandtHeorange 

sphere  represents  the  calcium  ion.  human.  These  human  enzymes 

A.  total  of  64  MMPs  have  been  sequenced  to  ^  )>sen  identified  in  invertebrates4'6  and  three 

have  counterparts  in  other  vertebrates.  general>  posseSs  a  propeptide  domain  (N-term,nal)^ 

have  recently  been  sequenced  from  P^1  S0U^*S'  except  for  MMP-7  which  lacks  it).8  MMP-  an 

a  catalytic  domain  and  a  hemopexin-hke  domain  ^  ^  have  acquired 

further  possess  a  fibronectin-like  ™  ^  ^  ?inc  ions  and  at  least  one  calcium  ion  coordinate 

transmembrane  domain.  In  the  ^,c  J-  ^  ^  ^  ^  *  *  active  site,  and  is  intimately  invo  ve 
to  various  residues  (Figure  1).  One  or 
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the  catalytic  processes  of  these  enzymes.  The  second  zinc  ion  (structural  zinc)  and  the  calcium  ion  are  present  in 
the  catalytic  domain  in  proximity  to  the  catalytic  zinc  (approximately  12  A  away).  The  catalytic  zinc  ion  is 
essential  for  the  proteolytic  activity  of  MMPs  and  the  three  histidine  residues  that  coordinate  with  the  catalytic  zinc 
are  conserved  among. MMPs.  However,  there  is  little  known  about  the  roles  of  the  second  zinc  ion  and  the 
calcium  ion  within  the  catalytic  domain.  It  was  noted  that  the  MMPs  have  high  affinities  for  the  structural  zinc 
and  the  calcium  ions.9,10  It  has  been  suggested  that  these  metal  ions  keep  the  structural  elements  together  in  the 
catalytic  domains  of  the  MMPs,  contributing  to  its  stability.11'13  We  have  compared  amino-acid  sequences  of  64 
MMPs  from  various  sources.  Whereas  the  catalytic  zinc  and  its  coordinating  residues  are  well  conserved, 
structural  zinc  and  the  calcium  ions  show  differences  in  their  corresponding  binding  motifs.  In  an  effort  to 
understand  the  coordination  patterns  of  the  structural  zinc  and  calcium  ions  in  the  catalytic  domains  of  various 
MMPs,  we  have  modeled  the  three-dimensional  structures  of  several  representative  members  of  the  MMPs.  A 
clear  insight  into  these  coordination  sites  is  important  due  to  the  fact  that  the  structural  zinc-binding  motif  as  well 
as  the  calcium-binding  motif  are  unique  to  the  matrix  metal loproteases  and  are  not  present  in  other  metzincin 
proteases  like  astacin  and  adamalysin.11  This  comprehensive  analysis  of  the  three-dimensional  models  of  the 
structural  zinc  and  calcium  ion  binding  regions  has  been  performed  here  for  the  first  time  to  expand  our 
knowledge  of  this  important  family  of  enzymes. 

Amino-acid  sequences  of  MMPs  were  obtained  from  the  GenBank,  TREMBL,  and  SwissProt  data  banks. 
We  utilized  a  total  of  64  MMP  sequences  from  various  sources  for  the  multiple-sequence  alignment  and  then,  a 
separate  analysis  was  run  using  the  representative  23  MMP  sequences  ( vide  infra).  The  human  enzymes  were 
selected  where  possible.  Four  human  MMPs,  fibroblast  (MMP-1,  leg!)12  and  neutrophil  (MMP-8,  lmnc)14 
collagenases,  matrilysin  (MMP-7,  Immq),15  and  stromelysin-1  (MMP-3,  Islm)16  have  recently  been  crystallized. 
The  17  modeled  MMPs  are  the  human  MMP- 10,  MMP-1 1,  MMP- 12,  MMP- 13,  MT1-MMP  (MMP- 14),  MT2- 
MMP  (MMP- 15),  MT3-MMP  (MMP- 16),  MT4-MMP  (MMP- 17),  MMP- 19  (same  as  MMP- 18),  pig  enamelysin, 
sea  urchin  envelysin  (SwissProt  accession  number  P22757),  stromelysin-like  MMP  from  newt  (GenBank 
accession  number  D82053),  collagenase-4  from  frog,  nematode  MMP  (GenBank  accession  number  U00038), 
chicken  CMMP,  frog  XMMP,  and  MMP  from  mouse-ear  cress  (TREMBL  accession  number  004529).  The 
multiple-sequence  alignments  were  performed  using  the  program  PileUp  from  the  Wisconsin  package  version  9. 
We  used  the  existing  X-ray  crystal  structural  information  to  predict  the  three-dimensional  structures  for  the 
catalytic  domains  of  17  homologous  metal  loproteases  using  the  program  COMPOSER.17  Water  molecules  that 
occupy  the  average  positions  in  the  crystallized  MMPs  and  the  two  zinc  and  one  calcium  ions  were  added  to  the 
folded  structures,  and  the  entire  complexes  were  allowed  to  undergo  10000  cycles  of  energy-minimization 
according  to  the  methodology  reported  by  us  earlier.18  We  have  reported  recently  on  such  computational 
structures  for  the  catalytic  domains  of  gelatinases  A  (MMP-2)  and  B  (MMP-9).18  After  modeling  the  catalytic 
domains  of  the  17  MMPs,  we  utilized  this  information  in  conjunction  with  the  four  X-ray  crystal  structures  of 
MMPs  and  previously  modeled  MMP-2  and  MMP-9  structures  to  compare  the  structural  zinc-binding  and 
calcium-binding  regions. 

Figure  2  shows  the  multiple-sequence  alignment  for  the  structural  zinc-binding  and  calcium-binding  sites 
of  the  catalytic  domains  of  the  representative  23  MMPs.  These  metal-binding  sites  are  formed  by  (3-strands  and 
turns.  Analysis  of  the  second  zinc-binding  site  revealed  that  the  MMPs  developed  at  least  four  different  motifs  for 
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binding  to  this  structural  zinc  ion.  For  the  majority  of  MMPs  (except  for  MMP-11,  MMP-17,  CMMP  and 
XMMP),  this  site  consists  of  the  side  chains  of  an  aspartic  acid  and  three  histidine  residues;  which  are  consistent 
with  the  existing  crystal  structures.  Two  of  the  coordinating  histidine  imidazoles  are  bound  as  His5  and  one  as 
Hise  (Figure  2;  the  three-dimensional  structure  of  this  site  is  shown  in  Figure  3A).  The  signature  for  this  site  in 
the  60  MMPs  is  the  sequence(s)  //^[GN]-D-X(2)-[PAS]-F-D-[GA]-X(4)-[LIRV]~[AG]-/ir^[AV]-[FYS]-P- 
X(5,7,9)-tf£-[FL]~D-X(2)-E-X-W,  where  residues  that  provide  side  chains  for  coordination  to  zinc  and  calcium 
are  shown  in  bold  italics,  X  (5,7,9)  indicates  5,  7,  or  9  variable  residues  in  between  the  flanking  sites  (H6is  a 
histidine  protonated  at  NS1  and  H£  is  a  histidine  protonated  at  Ne2).  The  first  phenylalanine  and  the  second 
phenylalanine/tyrosine  (i.e.,  the  FYS  sequence)  in  this  pattern  are  brought  close  to  the  coordinated  histidines  by 
the  enzyme  fold.  This  creates  an  increased  hydrophobic  environment,  which  presumably  would  enhance  the 
binding  affinity  for  the  metal  ions.  The  human  MMP-1 1  has  an  aspartic  acid  instead  of  one  of  the  His6.  This  is  in 
contrast  to  the  MMP-11  from  rabbit  and  mouse,  which  still  have  a  histidine  at  this  position  (data  not  shown). 
Therefore,  we  conclude  that  MMP-1 1  (compared  to  those  from  human,  rabbit  and  mouse)  does  not  have  a  unique 
motif  for  binding  of  the  structural  zinc. 
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MMP-1  human 
MMP-2  human 
MMP-3  human 
MMP-7  human 
MMP-8  human 
MMP-9  human 
MMP-10  human 
MMP-11  human 
MMP-1 2  human 
MMP-13  human 
MT1-MMP  human 
MT2-MMP  human 
MT3-MMP  human 
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MMP19  human 
Collagenase-4  frog 
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DIMISFVRGDHRDNSP-FDG.PGGNLAHAFQPGPGIGGD-AHFDEDERWTNN - 

DIMINFGRWEHGDGYP-FDS.KDGL.LAHAFAPGTGVGGD-SHFDDDELWTLGEGQVVRVKY 

DIMISFAVREHGDFYP-FD(LPGNVLAHAYAPGPGINGD-AHFDDDEQWTKD - 

D I M I G  F ARG AH  GD  S  Y  P - FD  G.PGNTL AH AF AP  GT  GLGG  D - AH  FD  E  DE  RWT DG - 

DINIAFYQRDHGDNSP-FDG.PNGXLAHAFQPGQGIGGD-AHFDAEETWTNT - 

DIVIQFGVAEHGDGYP-FD&KDGLLAHAFPPGPGIQGD-AHFDDDELWSLGKGVVVPTRF 

dimisfavkehgdfys-fdgpghslahayppgpglygd-ihfdddekwted - 

DIMIDFARYWDGDDLP-FDG.P£GILAHAFFPKTHREGD-VHFDYDETWTIGDDQ - 

DILVVFARGAHGDFHA-FDG.KGGILAHAFGPGSGIGGD-AHFDEDEFWTTH - 

DIMISFGIKEHGDFYP-FDG.PSGLLAHAFPPGPNYGGD-AHFDDDETWTSS - 

dimiffaegfhgdstp-fdgeggflahayfpgpniggd-thfdsaepwtvrnedl - 

DIMVLFASGFHGDSSP-FD£_TGGFLAHAYFPGPGLGGD-THFDADEPWTFSSTDL - 

DIPIIFASGFHGDSSP-FDGEGGFLAHAYFPGPGIGGD-THFDSDEPWTLGNPNH - 

DIQIDFSKADHNDGYP-FDA -RRHRAHAFFPGHHHTAGYTHFNDDEAWTFRSSDA - 

DIRLSFHGRQSS  YCSNTFDG_PGRVLAHADIPELG - SVHFDEDEFWTEGTY - 

DIEISFTAGDHKDNSP-FDG.SGGXLAHAFQPGNGIGGD-AHFDEDETWTKT - 

DIQI SFGAREHGDFNP-FDC^PXGTLAHAFAPGTGIGGD-AHFDEDEKWSKV - 

DIMVAFGTKAHGHCPRYFDG.PLGVLAHAFPPGSGFGGD-VHFDEDEDWTMG - 

DIMISFETGDHGDSYP-FDS.PRGTLAHAFAPGEGLGGD-THFDNAEKWTMG - 

DIRIKFGSYDHGDGIS-FDG.RGGVLAHAFLPRNG - DAHFDDSETWTEGTR - 

DIYIAFEKGEHSDGFP-FDG_QDGVVAHAFYPRDG - RLHFDAEEQWSLNSV - 

DIKLGFGRGRHLGCSRAFDG.SGQEFAHAWFLGD - IHFDDDEHFTAPS - 

DITIGFYTGDHGDGEP-FDG.VLGTLAHAFSPPSG - KFHLDADENWVVSGDL - 
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Figure  2.  Multiple-sequence  alignment  of  the  structural  zinc  and  calcium  binding  regions  in  the  catalytic 
domains  of  23  representative  MMPs.  The  structural  zinc  binding  residues  in  majority  of  MMPs  are  given  in  blue 
-  three  histidines  and  one  aspartic  acid;  the  structural  zinc  binding  domain  in  MMP- 19,  CMMP,  and  XMMP  are  in 
green.  The  residues  that  coordinate  to  the  calcium  ion  are  in  pink;  “underlined”  amino  acids  chelate  to  the  calcium 
ion  via  their  backbone  carbonyl  moiety;  the  variation  which  was  observed  in  the  calcium  binding  site  of  MMP-17 
(MT4-MMP)  is  given  in  cyan. 
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The  remaining  MMPs,  those  of  chicken  CMMP,  the  frog  XMMP,  and  the  human  MMP- 19  (Figure  2), 
each  possess  unique  motifs  for  binding  to  the  structural  zinc  ion.  The  chicken  CMMP  has  a  histidine  where  the 
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Figure  3.  Stereo  views  of  the  structural  zinc 
inding  motif  in  the  catalytic  domain  of  the 
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Analysis  of  the  structure  for  MMP- 19  showed  that  this  enzyme  has  two  histidines  for  metal  coordination 
at  the  usual  places  and  one  cysteine  at  the  corresponding  position  already  discussed  in  the  sequences  for  CMMP 
and  XMMP.  However,  MMP-19  possesses  a  nomcoordinating  serine  (Ser-163)  at  the  position  where  the 
majority  of  MMPs  have  the  third  histidine.  Interestingly,  the  third  coordinating  histidine  (His-159  Figure  3E)  in 
this  enzyme  is  provided  at  an  entirely  distinct  position,  four  residues  to  the  N-terminus.  Whereas  the  position  of 

the  amino  acid  is  different  and  indeed  on  a  different  P-strand,  it  provided  the  side  chain  for  the  fourth  coordination 

site  in  a  perfectly  acceptable  orientation  in  space  (for  example,  compare  the  positions  of  His-159  and  His-270 
Figures  3E  and  3D,  respectively).  .  m 

Binding  of  the  calcium  ion  brings  six  specific  elements  of  MMP  catalytic  domain  for  coordination  in  an 
octahedral  fashion.  Three  of  these  elements  are  the  side  chains  of  two  aspartic  acids  and  one  glutamic  acid  whi  h 
are  conserved  in  all  23  MMPs  except  in  MT4-MMP  {vide  infra,  Figure  2).  The  remaining  three  calcium  ligands 
are  provided  by  the  backbone  carbonyl  oxygens  of  three  residues  within  a  turn  made  up  of  five  amino  acids 
(Figure  2).  A  typical  calcium-binding  motif  is  shown  in  Figure  4A.  The  only  exception  to  this  general  picture 
would  appear  to  be  MT4-MMP  (MMP- 17),  which  has  undergone  one  amino  acid  deletion  in  this  turn  (i  e  it  h 
a  four-amino  acid  turn).  In  human  MMP-17  (MT4-MMP),  an  asparagine  residue  is  observed  instead’of  the 

second  aspartic  acid  (Figure  2).  The  coordination  to  the  calcium  ion  for  MT4-MMP  (MMP-17)  is  shown  ' 
Figure  4B.  in 


Figure  4.  Stereo  views  of  the  calcium  binding 
that  for  MMP-17  (B). 


in  majority  of  MMPs,  except  for  MMP-17  (A); 


In  conclusion,  there  are  at  least  four  different  motifs  for  the  binding  to  the  structural  zinc  ion  that  are 
discernible  from  the  three-dimensional  structure  analysis.  However,  the  calcium  binding  motif  is  more  strictly 
conserved.  These  variations  in  the  metal-binding  motifs  preserve  the  topology  of  the  structural  zinc-binding  site 
as  well  as  the  calcium-binding  site,  and  hence  the  general  fold  for  the  catalytic  domain  is  highly  preserved.  The 
second  zinc  binding  site  is  present  in  all  MMPs  and  is  important  for  the  activity  of  these  enzymes.  Hence  the 
structural  variations  that  are  noted  here  in  the  coordination  to  the  structural  zinc  ion  may  be  indicative  of  the 
different  outcomes  for  selection  of  novel  enzymic  activities.  Furthermore,  dendrogram  analysis  of  the  catalytic 
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domain  sequence  alignment  for  MMPs  indicates  that  these  variations  in  the  zinc-binding  motif  came  about  as  a 
consequence  of  independent  evolutionaiy  processes,  unrelated  to  one  another.20  Since  these  motifs  are  absent  in 
other  members  of  the  metzincin  family  of  enzymes,  we  venture  to  say  that  the  existence  of  these  metal-binding 
sites  in  MMPs  must  have  arisen  in  response  to  specific  needs  unique  to  MMPs  with  reference  to  their  substrate 
specificities.  The  information  provided  herein  is  intended  to  stimulate  interest  in  exploring  the  role  of  these  motifs 
in  MMPs  geared  toward  understanding  the  substrate  specificities  for  these  enzymes,  which  are  not  understood  to 
any  appreciable  degree  at  the  present.  Our  understanding  of  the  actual  functions  of  the  MMPs  is  at  a  rudimentary 
stage  at  the  present  and  as  more  sequences  of  MMPs  become  available,  the  analysis  presented  here  should  be 
updated  and  correlated  with  the  new  stnictural  information  to  shed  light  on  the  various  properties  of  these 
important  enzymes. 
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